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GRAPEVINE LEAFROLL VIRUS (TYPE 2) PROTEINS AND THEIR USES 

This application claims the benefit of U.S. Provisional Patent Application 
Serial No. 60/047,194, filed May 20, 1997. This work was supported by the U.S. Department 
5 of Agriculture Cooperative Grant No. 58-2349-9-01 . The U.S. Government may have certain 
rights in the invention. 

FIELD OF THE INVENTION 

]0 The present invention relates to grapevine leafroll virus (type 2) proteins, 

DNA molecules encoding these proteins, and their uses. 

BACKGROUND OF THE INVENTION 

1 5 The world's most widely grown fruit crop, the grape (Vitis sp.\ is cultivated 

on all continents except Antarctica. However, major grape production centers are in 
European countries (including Italy, Spain, and France), which constitute about 70% of the 
world grape production (Mullins et al.. Biology of the Grapevine , Cambridge, 
U.K.:University Press (1992)). The United States, with 300,000 hectares of grapevines, is the 

20 eighth largest grape grower in the world. Although grapes have many uses, a major portion 
of grape production (-80%) is used for wine production. Unlike cereal crops, most of the 
world's vineyards are planted with traditional grapevine cultivars, which have been 
perpetuated for centuries by vegetative propagation. Several important grapevine virus and 
virus-like diseases, such as grapevine leafroll, corky bark, and Rupestris stem pitting, are 

25 transmitted and spread through the use of infected vegetatively propagated materials. Thus, 
propagation of certified, virus-free materials is one of the most important disease control 
measures. Traditional breeding for disease resistance is difficult due to the highly 
heterozygous nature and outcrossing behavior of grapevines, and due to polygenic patterns of 
inheritance. Moreover, introduction of a new cultivar may be prohibited by custom or law. 

30 Recent biotechnology developments have made possible the introduction of special traits, 
such as disease resistance, into an established cultivar without altering its horticultural 
characteristics. 

Many plant pathogens, such as fungi, bacteria, phytoplasmas, viruses, and 
nematodes can infect grapes, and the resultant diseases can cause substantial losses in 
35 production (Pearson et al., Compendium of Grape Diseases . American Phytopathological 
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Society Press ( 1 988)). Among these, viral diseases constitute a major hindrance to profitable 
growing of grapevines. About 34 viruses have been isolated and characterized from 
grapevines. The major virus diseases are grouped into: (1) the grapevine degeneration caused 
by the fanleaf nepovirus, other European nepoviruses, and American nepoviruses, (2) the 
5 leafroll complex, and (3) the rugose wood complex (Martelli, ed., Graft Transmissible 
Diseases of Grapevines. Handbook for Detection and Diagnosis , FAO, UN, Rome, Italy 
(1993)). 

Of the major virus diseases, the grapevine leafroll complex is the most widely 
distributed thioughout the world. According to Goheen ("Grape Leafroll," in Frazier et aL, 

10 eds.. Virus Diseases of Small Fruits and Grapevines (A Handbook) , University of California, 
Division of Agricultural Sciences, Berkeley, Calif, USA, pp. 209-212 (1970) ("Goheen 
(1970)"), grapevine leafroIMike disease was described as early as the 1850s in German and 
French literature. However, the viral nature of the disease was first demonstrated by Scheu 
(Scheu, "Die Rollkrankheit des Rebstockes (Leafroll of grapevine)," D. D. Weinbau 14:222- 

15 358 (1935) ("Scheu (1935)")). In 1946, Harmon and Snyder (Harmon et al., "Investigations 
on the Occurrence, Transmission, Spread and Effect of 'White' Fruit Colour in the Emperor 
Grape," Proc. Am. Soc. Hort. Sci. 74:190-194 (1946)) determined the viral nature of White 
Emperor disease in California. It was later proven by Goheen et al. (Goheen et al., "Leafroll 
(White Emperor Disease) of Grapes in California, Phytopathology , 48:51-54 (1958) 

20 ("Goheen (1958)")) that both leafroll and "White Emperor" diseases were the same, and only 
the name "leafroll" was retained. 

Leafroll is a serious viral disease of grapes and occurs wherever grapes are 
grown. This wide distribution of the disease has come about through the propagation of 
diseased vines. It affects almost all cultivated and rootstock varieties of Vitis, Although the 

25 disease is not lethal, it causes yield losses and reduction of sugar content. Scheu estimated in 
1936 that 80 per cent of all grapevines planted in Germany were infected (Scheu, Mein 
Winzerbuch , Berlin:Reichsnahrstand-Verlags (1936)). In many California wine grape 
vineyards, the incidence of leafroll (based on a survey of field symptoms conducted in 1959) 
agrees with Scheu's initial observation in German vineyards (Goheen et al., "Studies of 

30 Grape Leafroll in California," Amer. J. Enol. Vitic , 10:78-84 (1959)). The current situation 
on leafroll disease does not seem to be any better (Goheen, "Diseases Caused by Viruses and 
Viruslike Agents," The American Phytopathological Society , St. Paul, Minnesota: APS Press, 
1:47.54 (1988) ("Goheen (1988)"). Goheen also estimated that the disease causes an annual 
loss of about 5-20 per cent of the total grape production (Goheen (1970) and Goheen (1988)). 
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The amount of sugar in individual berries of infected vines is only about 1/2 to 2/3 that of 
berries from noninfected vines (Goheen (1958)). 

Symptoms of leafroll disease vary considerably depending upon the cultivar, 
enviroiunent, and time of the year. On red or dark-colored fruit varieties, the typical 
5 downward rolling and interveinal reddening of basal, mature leaves is the most prevalent in 
autumn; but not in spring or early summer. On light-colored fruit varieties however, 
; symptoms are less conspicuous, usually with downward rolling accompanied by interveinal 

chlorosis. Moreover, many infected rootstock cultivars do not develop symptoms. In these 
cases, the disease is usually diagnosed with a woody indicator indexing assay using Vitis 

1 0 vivifera cv, Carbernet Franc (Goheen (1 988)). 

Ever since Scheu demonstrated that leafroll was graft transmissible, a virus 
etiology has been suspected (Scheu (1935)). Several virus particle types have been isolated 
from leafroll diseased vines. These include potyvirus-like (Tanne et al., "Purification and 
Characterization of a Virus Associated with the Grapevine Leafroll Disease," 

15 Phytopathology , 67:442-447 (1977)), isometric virus-like (Castellano et al./'Virus-like 

Particles and Ultrastructural Modifications in the Phloem of Leafroll-affected Grapevines," 
Vitis , 22:23-39 (1983) ("Castellano (1983)") and Namba et al., "A Small Spherical Virus 
Associated with the Ajinashika Disease of Koshu Grapevine, Ann. Phvtopathol. Soc. Japan . 
45:70-73 (1979)), and closterovirus-like (Namba, "Grapevine Leafroll Virus, a Possible 

20 Member of Closteroviruses, Ann. Phvtopathol. Soc. Japan , 45:497-502 (1979)) particles. In 
recent years, however, long flexuous closteroviruses ranging from 1,400 to 2,200 nm have 
been most consistently associated with leafroll disease (Figure 1) (Castellano (1983), Faoro 
et al., "Association of a Possible Closterovirus with Grapevine Leafroll in Northern Italy," 
Riv. Patol. Veg.. Ser IV . 17:183-189 (1981), Gugerh et al., "L'enroulement de la vigne: mise 

25 en evidence de particules virales et developpement d'une methode immuno-enzymatique 

pour le diagnostic rapide (Grapevine Leafroll: Presence of Virus Particles and Development 
of an Immuno-enzyme method for Diagnosis and Detection)," Rev. Suisse Viticult. 
Arboricuh. Hort. . 16:299-304 (1984) ("Gugerli (1984)"), Hu et al., "Characterization of 
Closterovirus-like Particles Associated with Grapevine Leafroll Disease," J. Phvtopathol. , 

30 128:1-14 (1990) ("Hu (1990)"), Milne et al., "Closterovirus-like Particles of Two Types 
Associated with Diseased Grapevines," Phvtopathol. Z. , 1 10:360-368 (1984), Zee et al., 
"Cytopathology of Leafroll-diseased Grapevines and the Purification and Serology of 
Associated Closteroviruslike Particles," Phytopathology . 77:1427-1434 (1987) ("Zee 
(1987)"), and Zimmermann et al., "Characterization and Serological Detection of Four 
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Closterovirus-like Particles Associated with Leafroll Disease on Grapevine," J. Phytopathol. . 
130:205-218 (1990) ("Zimmermann (1990)")). These closteroviruses are referred to as 
grapevine leafroll associated viruses ("GLRaV"). At least six serologically distinct types of 
GLRaV's (GLRaV"l to -6) have been detected from leafroll diseased vines (Table 1) (Boscia 
5 et al., "Nomenclature of Grapevine Leafroll-associated Putative Closteroviruses, Vitis , 
34:171-1 75 (1995) ("Boscia (1995)") and (Martelli, "Leafroll," pp. 37-44 in Martelli, ed., 
Graft Transmissible Diseases of Grapevines, Handbook for Detection and Diag nosis. FAO, ^ 
Rome Italy, (1993) ("Martelli I")). The first five of these were confirmed in the 10th 
Meeting of the International Council for the Study of Virus and Virus Diseases of the 
1 0 Grapevine ("ICVG") (Volos, Greece, 1 990). 



TABLE 1 



Type 


Particle length 
(nm) 


Coat 

protein Mr 
(XI 0^) 


Reference 


GLRaV-l 


1,400-2,200 


39 


Gugerli (1984) 


GLRaV-2 


1,400-1,800 


26 


Gugerli (1984) 
Zimmermann (1990) 


GLRaV-3 


1,400-2,200 


43 


Zee (1987) 


GLRaV-4 


1,400-2,200 


' 36 


Hu(1990) 


GLRaV-5 


1,400-2,200 


36 


Zimmermann (1990) 


GLRaV-6 


1,400-2,200 


36 


Gugerli (1993) 



15 Through the use of monoclonal antibodies, hov^ever, the original GLRaV 11 described in 

Gugerli (1 984) has been shovm to be an apparent mixture of at least two components, Ila and 
lib (Gugerli et al., "Grapevine Leafroll Associated Virus II Analyzed by Monoclonal 
Antibodies," 1 1th Meeting of the International Council for the Study of Viruses and Virus 
Diseases of the Grapevine . Montreux, Sv^itzerland, pp. 23-24 (1993) ("Gugerli (1993)")). 

20 Recent investigation with comparative serological assays (Boscia (1995)) demonstrated that 
the lib component of cv, Chasselas 8/22 is the same as the GLRaV -2 isolate from France 
(Zimmermann (1990)) which also include the isolates of grapevine corky bark associated 
closteroviruses from Italy (GCBaV-BA) (Boscia (1995)) and from the United States 
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(GCBaV-NY) (Namba et al., 'Turification and Properties of Closlerovirus-like Particles 
Associated with Grapevine Corky Bark Disease," Phvtopatholopv . 81 :964"970 (1991) 
("Namba (1991)")). The Ila component of cv. Chasselas 8/22 was given the provisional 
naane of grapevine leafroll associated virus 6 (GLRaV-6). Furthermore, the antiserum to the 
5 CA-5 isolate of GLRaV-2 produced by Boscia et al. (Boscia et al., "Characterization of 

Grape Leafroll Associated Closterovirus (GLRaV) Serotype II and Comparison with GLRaV 
I Serotype III," Phytopathology, 80:1 17 (1990)) was shown to contain antibodies to both 

GLRaV-2 and GLRaV- 1, with a prevalence of the latter (Boscia (1995)). 

Virions of GLRaV-2 are flexuous, filamentous particles about K400-L800 nm 

1 0 in jength (Gugerli et al., "L'enroulement de la Vigne: Mise en Evidence de Particles Virales 
et Development d'une Methode Immuno-enzymatique Pour le Diagnostic Rapide (Grapevine 
Leafroll: Presence of Virus Particles and Development of an Immuno-enzyme Method for 
Diagnosis and Detection)," Rev. Suisse Viticult. Arboricuh. Horticult. 16:299-304 (1984)). 
A double-stranded RNA (dsRNA) of about 15 kb was consistently isolated from GLRaV-2 

15 infected tissues (Goszczynski et al., "Detection of Two Strains of Grapevine Leafroll- 

Associaled Virus 2," Vitis 35:133-35 (1996)). The coat protein of GLRaV-2 is ca 22'-'26 kDa 
(Zimmermami et al., "Characterization and Serological Detection of Four Closterovirus-like 
Particles Associated with Leafroll Disease on Grapevine," J. Phvtopathologv 130:205-18 
(1990); Gugerli and Ramel, Extended abstracts: "Grapevine Leafroll Associated Virus II 

20 Analyzed by Monoclonal Antibodies," 11 th ICVG at Montreux, Switzerland, Gugerli, ed.. 
Federal Agricultural Research Station of Changins, CH-1260 Nyon, Switzerland, p. 23-24 
(1993); Boscia et ah, "Nomenclature of Grapevine Leafroll-Associated Putative 
Closleroviruses," Vitis 34:171-75 (1995)), which is considerably smaller than other GLRaVs 
(35-43 kDa) (Zee et al., "Cytopathology of Leafroll-Diseased Grapevines and the 

25 Purification and Serology of Associated Closterovirus Like Particles," Phvtopathologv 
77:1427-34 

(1987); Hu et al., "Characterization of Closterovirus-Like Particles Associated with 
Grapevine Leafroll Disease," J. of Phytopathology 128:1-14 (1990); Ling et al., "The Coat 
Protein Gene of Grapevine Leafroll Associated Closteroyirus-3: Cloning, Nucleotide 
30 Sequencing and Expression in Transgenic Plants," Arch, of Virology 142:1 101-16 (1997)). 
Although GLRaV-2 has been classififed as a member of the genus Closterovirus based on 
particle morphology and cytopathology (Martelli, Circular of ICTV-Plant Virus 
Subcommittee Study Group on Closterolike Viruses" (1996)), its molecular and biochemical 
properties are not well characterized. 
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In the closterovirus group, several viruses have recently been sequenced. The 
partial or complete genome sequences of beet yellows virus (BYV) ( Agranovsky et al. 
"Nucleotide Sequence of the 3'-Terminal Half of Beet Yellows Closterovirus RNA Genome 
Unique Arrangement of Eight Virus Genes," J. General Virology 72:15-24 (1991); 
5 Agranovsky et al., "Beet Yellows Closterovirus: Complete Genome Structure and 

Identification of a Papain-like Thiol Protease," Virology 198:31 1-24 (1994)), beet yellow 
stunt virus (BYSV) (Karasev et al., "Organization of the 3*-Terminai Half of Beet Yellow 
Stunt Virus Genome and Implications for the Evolution of Closteroviruses," Virology 
22 1 : 1 99-207 (1 996)), citrus tristeza virus (CTV) (Pappu et al., "Nucleotide Sequence and 

10 Organization of Eight 3* Open Reading Frames of the Citrus Tristeza Closterovirus Genome," 
Virology 199:35-46 (1994); Karasev et al., "Complete Sequence of the Citrus Tristeza Virus 
RNA Genome," Virology 208:51 1-20 (1995)), lettuce infectious yellows virus (LIYV) 
(Klaassen et al., "Partial Characterization of the Lettuce Infectious Yellows Virus Genomic 
RNAs, Identification of the Coat Protein Gene and Comparison of its Amino Acid Sequence 

1 5 With Those of Other Filamentous RNA Plant Viruses," J. General Virology 75:1525-33 

(1994) ; Klaassen et al., "Genome Structure and Phylogenetic Analysis of Lettuce Infectious 
Yellows Virus, a Whitefly-Transmitted, Bipartite Closterovirus," Virology 208:99-1 10 

(1995) ), little cherry virus (LChV) (Keim and Jelkmann, "Genome Analysis of the 3'- 
Terminal Part of the Little Cherry Disease Associated dsRNA Reveals a Monopartite 

20 Clostero-Like Virus," Arch. Virology 141:1437-51 (1996); Jellonann et al., "Complete 
Genome Structure and Phylogenetic Analysis of Little Cherry Virus, a Mealybug- 
Transmissible Closterovirus," J. General Virology 78:2067-71 (1997)), and GLRaV-3 (Ling 
et al., "Nucleotide Sequence of the 3' Terminal Two-Thirds of the Grapevine Leafroll 
Associated Virus-3 Genome Reveals a Typical Monopartite Closterovirus," J. Gen. Virology 

25 79(5):1289-1301 (1998)) revealed several common features of the closteroviruses, including 
the presence of HSP70 chaperone heat shock protein and a duplicate of the coat protein gene 
(Agranovsky "Principles of Molecular Organization, Expression, and Evolution of 
Closteroviruses: Over the Barriers," Adv. in Virus Res. 47:1 19-218 (1996); Dolja et al. 
^'Molecular Biology and Evolution of Closteroviruses: Sophisticated Build-up of Large RNA 

30 Genomes," Annual Rev, Photopathology 32:261-85 (1994); Boyko et al., "Coat Protein Gene 
Duplication in a Filamentous RNA Virus of Plants," Proc. Nat. Acad. Sci. USA 89:9156-60 
(1 992)). Characterization of the genome organization of GLRaVs would provide molecular 
information on the serologically distinct closteroviruses that cause similar leafroll symptoms 
in grapevine. 
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Several shorter closteroviruses (particle length 800 nm long) have also been 
isolated from grapevines. One of these, called grapevine virus A ("GVA") has also been 
found associated, though inconsistently, with the leafroll disease (Agran et al., "Occurrence 
of Grapevine Virus A (GVA) and Other Closteroviruses in Tunisian Grapevines Affected by 
5 Leafroll Disease," Vitis, 29:43-48 (1990), Conti, et al., "Closterovirus Associated with 
Leafroll and Stem Pitting in Grapevine," Phytopathol. Mediterr. . 24: 11 0-1 13 (1 985), and 
Conti et al., "A Closterovirus from a Stem-pitting-diseased Grapevine," PhvtoDatholopy . 
70:394-399 (1 980)). The etiology of GVA is not really known; however, it appears to be 
more consistently associated with rugose wood sensu lato (Rosciglione at ai., "Maladies de 

1 0 Tenroulement et du bois strie de la vigne: analyse microscopique et serologique (Leafroll and 
Stem Pitting of Grapevine: Microscopical and Serological Analysis)," Rev. Suis.se Vitir 
Arboric. Hortic, 18:207-211 (1986) ("Rosciglione (1986)"), and Zimmermann (1 990)). 
Moreover, another short closterovirus (800 nm long) named grapevine virus B ("GVB") has 
been isolated and characterized from corky bark-affected vines (Boscia et al., "Properties of a 

1 5 Filamentous Virus Isolated from Grapevines Affected by Corky Bark," Arch. Virol. . 
1 3 0: 1 09- 1 20 ( 1 993) and Namba ( 1 991 )). 

As suggested by Martelli I, leafroll symptoms may be induced by more than 
one virus or they may be simply a general plant physiological response to invasion by an 
array of phloem-inliabiting viruses. Evidence accumulated in the last 15 years strongly 

20 favors the idea that grapevine leafroll is induced by one (or a complex) of long 
closteroviruses (particle length 1,400 to 2,200 nm). 

Grapevine leafroll is transmitted primarily by contaminated scions and 
rootstocks. However, under field conditions, several species of mealybugs have been shown 
to be the vector of leafroll (Engelbrecht et al., "Transmission of Grapevine Leafroll Disease 

25 and Associated Closteroviruses by the Vine Mealybug Planococcus-ficus," Phytophvlactica . 
22:341-346 (1990), Rosciglione, et al., "Transmission of Grapevine Leafroll Disease and an 
Associated Closterovirus to Healthy Grapevine by the Mealybug Planococcus ficus," 
(Abstract), Phvtoparasitica . 17:63-63 (1989), and Tanne, "Evidence for the Transmission by 
Mealybugs to Healthy Grapevines of a Closter-like Particle Associated with Grapevine 

30 Leafroll Disease," Phvtoparasitica . 16:288 (1988)). Natural spread of leafroll by insect 
vectors is rapid in various parts of the world. In New Zealand, observations of three 
vineyards showed that the number of infected vines nearly doubled in a single year (Jordan et 
al., "Spread of Grapevine Leafroll and its Associated Virus in New Zealand Vineyards," 1 1th 
Meeting of the International Council for the Study of Viruses and Virus Diseases of the 
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Grapevine , Montreux, Switzerland, pp. 113-1 14 (1993)). One vineyard became 90% infected 
5 years after GLRaV-3 was first observed. Prevalence of leafroll worldwide may increase as 
chemical control of mealybugs becomes more difficult due to the unavailability of effective 
insecticides. 

In view of the serious risk grapevine leafroll virus poses to vineyards and the 
absence of an effective treatment of it, the need to prevent this affliction continues to exist. 
The present invention is directed to overcoming this deficiency in the art. 

SUMMARY OF INVENTION 

The present invention relates to an isolated protein or polypeptide 
corresponding to a protein or polypeptide of a grapevine leafroll virus (type 2). The encoding 
RNA and DNA molecules, in either isolated form or incorporated in an expression system, a 
host celL a transgenic Vitis or citrus scion or rootstock cultivar, or a transgenic Nicotiana 
plant or beet plant are also disclosed. 

Another aspect of the present invention relates to a method of imparting 
grapevine leafroll virus (type 2) resistance to Vitis scion or rootstock cultivars or Nicotiana 
plants by transforming them with a DNA molecule encoding the protein or polypeptide 
corresponding to a protein or polypeptide of a grapevine leafroll virus (type 2). Other aspects 
of the present invention relate to a method of imparting beet yellows virus resistance to beet 
plants and a method of imparting tristeza virus resistance to citrus scion or rootstock 
cultivars, both by transforming the plants or cultivars with a DNA molecule encoding the 
protein or polypeptide corresponding to a protein or polypeptide of a grapevine leafroll virus 
(type 2). 

The present invention also relates to an antibody or binding portion thereof or 
probe which recognizes the protein or polypeptide. 

Grapevine leafroll virus resistant transgenic variants of the current commercial 
grape cultivars and rootstocks allows for more complete control of the virus, while retaining 
the varietal characteristics of specific cultivars. Furthermore, these variants permit control of 
GLRaV-2 transmitted either by contaminated scions or rootstocks or by a presently 
uncharacterized insect vector. With respect to the latter mode of transmission, the present 
invention circumvents increased restriction of pesticide use which has made chemical control 
of insect infestation increasingly difficult. In this manner, the interests of the environment 
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and the economics of grape cultivation and wine making are all furthered by the present 
invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 

5 

Figures 1 A and IB are a comparison of a double-stranded RNA (dsRNA) 
/ profile (Figure 1 A) of GLRaV-2 and its Northern hybridization analysis (Figure IB). In 

Figure 1 A: lane M, lambda Hind III DNA marker; and lane 1, dsRNA pattern in 1% agarose 
gel stained with ethidium bromide. Figure IB is a northern hybridization of isolated high 

1 0 molecular weight dsRNA of GLRaV-2 with a probe prepared with ^^P [a-dATP] labeled 
cDNA insert from GLRaV-2 specific cDNA clone TC-1 . Lane 1 , high molecular weight 
dsRNA of GLRaV-2. Lane 2, total RNA extracted from healthy grapevine. 

Figure 2 displays the genome organization of GLRaV-2 and its sequencing 
strategy. Boxes represent ORFs encoded by deduced amino acid sequences of GLRaV-2, 

1 5 numbered lines represent nucleotide coordinates, beginning from 5'-terminal of RNA in 

kilobases (kb). The lines below GLRaV-2 RNA genome represent the cDNA clones used to 
determine the nucleotide sequences. 

Figure 3A-3D are comparisons between ORFla/ORFlb of GLRaV-2 and 
BYV. Figure 3A-3D show the conserved domains of two papain-like proteases (P-PRO), 

20 methyltransferase (MT/MTR), helicase (HEL), and RNA-dependent RNA polymerase 
(RdRP), respectively. Exclamation marks indicate the predicted catalytic residues of the 
leader papain-like protease; slashes indicate the predicted cleavage sites. The conserved 
motifs of the MT, HEL, and RdRP domains are highlighted with overlines marked with 
respective letters. The alignment is constructed using the MegAlign program in DNASTAR. 

25 Figures 4A and 4B are alignments of the nucleotide (Figure 4A) and deduced 

amino acid (Figure 4B) sequences of ORFla/ORFlb overlapping region of GLRaV-2, BYV, 
BYSV, and CTV. Identical nucleotides and amino acids are shown in consensus. GLRaV-2 
putative + 1 frameshift site (TAGC) and its corresponding sites of BYV (TAGC) and BYSV 
(TAGC) and CTV (CGGC) at nucleotide and amino acid sequences are highlighted with 

30 underlines. 

Figure 5 is an alignment of the amino acid sequence of HSP70 protein of 
GLRaV-2 and BYV. The conserved motifs (A to H) are indicated with overlines and marked 
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with respective letters. The alignment was conducted with the MegAlign program of 
DNASTAR. 

Figure 6A is a comparison of the coat protein (CP) and coat protein duplicate 
(CPd) of GLRaV-2 with other closteroviruses. The amino acid sequence of the GLRaV-2 CP 
5 and CPd are aligned with the CP and CPd of BYV, BYSV, and CTV. The conserved amino 
acid residues are in bold and the consensus sequences are indicated. Sequence alignment and 
phylogenetic tree were constructed by Clustal Method in the MegAlign Program of 
DNASTAR. Figure 6B is a tentative phylogenetic tree of the CP and CPd of GLRaV-2 with 
BYV, BYSV, CTV, LIYV, LChV, and GLRaV-3. To facilitate the alignment, only the C- 

10 terminal 250 amino acids of CP and CPd of LIYV, LChV, and GLRaV-3 were used. The 

scale beneath the phylogenetic tree represents the distance between sequences. Units indicate 
the number of substitution events. 

Figure 7 is a comparison of the genome organization of GLRaV-2, BYV, 
BYSV, CTV, LIYV, LChV, and GLRaV-3. P-PRO, papain-like protease; MT/MTR, 

1 5 methyltransferase; HEL, helicase; RdRP, RNA-dependent RNA polymerase; HSP70, heat 
shock protein 70; CP, coat protein; CPd, coat protein duplicate. 

Figure 8 is a tentative phylogenetic tree showing the relationship of RdRP of 
GLRaV-2 with respect to BYV, BYSV, CTV, and LIYV. The phylogenetic tree was 
constructed using the Clustal method with the MegAlign program in DNASTAR. 

20 Figure 9 is an alignment of the amino acid sequence of HSP90 protein of 

GLRaV-2 with respect to other closteroviruses, BYS, BYSV, and CTV, The most conserved 
motifs (I to II) are indicated with the highlighted lines and marked with respective letters. 

Figure 10 is an aligimient of the nucleotide sequence of 3'-terminal 
untranslated region of GLRaV-2 with respect to the closteroviruses BYV (Agranovsky et al., 

25 "Beet Yellows Closterovirus: Complete Genome Structure and Identification of a Papain-like 
Thiol Protease," Virology 198:31 1-24 (1994), which is hereby incorporated by reference), 
BYSV (Karasev et al., Organization of the 3'-Terminal Half of Beet Yellow Stunt Virus 
Genome and Implications for the Evolution of Closteroviruses," Virology 221 :1 99-207 
(1996), which is hereby incorporated by reference), and CTV (Karasev et al., "Complete 

30 Sequence of the Citrus Tristeza Virus RNA Genome," Virologv 208:51 1-20 (1995), which is 
hereby incorporated by reference). The consensus sequences are shown, and the distance to 
the 3'-end is indicated. A complementary region capable of forming a "hair-pin" structure is 
underlined. 

Figures 1 1 A and 11 B are genetic maps of the transformation vectors 
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pGA482GG/EPT8CP-GLRaV.2 and pGA482G/EPT8CP-GLRaV-2, respectively. As shown 
in Figures 1 1 A and 1 IB, the plant expression cassette (EPT8CP-GLRaV-2), which consists 
of a double cauliflower mosaic virus (CaMV) 35S-enhancer, a CaMV 35S-promoter, an 
alfalfa mosaic virus (ALMV) RNA4 5' leader sequence, a coat protein gene of GLRaV-2 
5 (CP-GLRaV-2), and a CaMV 35S 3' untranslated region as a terminator, was cloned into the 
transformation vector by EcoR 1 restriction site. The CP of GLRaV-2 was cloned into the 
plant expression vector by Nco I restriction site. 

Figure 12 is a PCR analysis of DNA molecules extracted from the leaves of 
putative transgenic plants using both the CP gene of GLRaV-2 and NPT II gene specific 

1 0 primers. An ethidium bromide-stained gel shows a 720 bp amplified DNA fragment for NPT 
II gene, and a 653 bp DNA fragment for the entire coding sequence of the CP gene. Lane 1 , 
<D174 / Hae III DNA Marker; lanes 2-6, transgenic plants from different lines; lane 7, the cp 
gene of GLRaV-2 of positive control; and lane 8, NPT II gene of positive control. 

Figure 13 is a comparison of resistant (right side 3 plants) and susceptible (left 

1 5 side 3 plants) transgenic Nicotiana benthamiana plants. Plants are shown 48 days after 
inoculation with GLRaV-2. 

Figure 14 is a northern blot analysis of transgenic Nicotiana benthamiana 
plants. An aliquot of 10 g of total RNA extracted from putative transgenic plants was 
denatured and loaded onto 1% agarose gel containing formaldehyde. The separated RNAs 

20 were transferred to Gene Screen Plus membrane and hybridized with a ''^P-labeled DNA 
probe containing the 3' one third CP gene sequence. Lanes 1, 3, and 4 represent 
nontransformed control plants without RNA expression. The remaining lanes represent 
transgenic plants from different lines: lanes 2, 14-17, and 22-27 represent plants with high 
RNA expression level which are susceptible to GLRaV-2; all other lanes represent plants 

25 with undetectable or low RNA expression level which are resistant to GLRaV-2. 



DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to isolated DNA molecules encoding for the 
30 proteins or polypeptides of a grapevine leafroll virus (type 2). A substantial portion of the 
grapevine leafroll virus (type-2) ("GLRaV-2") genome has been sequenced. Within the 
genome are a plurality of open reading frames ("ORFs") and a 3' untranscribed region 
("UTR"), each containing DNA molecules in accordance with the present invention. The 
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DNA molecule which constitutes a substantial portion of the GLRaV-2 genome comprises 
the nucleotide sequence corresponding to SEQ. ID. No. 1 as follows: 

TAAACATTGC GAGAGAACCC CATTAGCGTC TCCGGGGTGA ACTTGGGAAG GTCTGCCGCC 60 

GCTCAGGTTA TTTATTTCGG CAGTTTCACG CAGCCCTTCG CGTTGTATCC GCGCCAAGAG 120 

AGCGCGATCG TAAAAACGCA ACTTCCACCG GTCAGTGTAG TGAAGGTGGA GTGCGTAGCT 180 

GCGGAGGTAG CTCCCGACAG GGGCGTGGTC GhChAGhPJ\C CTACGTCTGT TGGCGTTCCC 24 0 

CCGCAGCGCG GTGTGCTTTC TTTTCCGACG GTGGTTCGGA ACCGCGGCGA CGTGATAATC 300 

ACAGGGGTGG TGCATGAAGC CCTGAAGAAA ATTAAAGACG GGCTCTTACG CTTCCGCGTA 360 

GGCGGTGACA TGCGTTTTTC GAGATTTTTC TCATCGAACT ACGGCTGCAG ATTCGTCGCG 4 20 

AGCGTGCGTA CGAACACTAC AGTTTGGCTA AATTGCACGA AAGCGAGTGG TGAGAAATTC 4 80 

TCACTCGCCG CCGCGTGCAC GGCGGATTAC GTGGCGATGC TGCGTTATGT GTGTGGCGGG 54 0 

AAATTTCCAC TCGTCCTCAT GAGTAGAGTT ATTTACCCGG ATGGGCGCTG TTACTTGGCC 600 

CATATGAGGT ATTTGTGCGC CTTTTACTGT CGCCCGTTTA GAGAGTCGGA TTATGCCCTC 660 

GGT^ATGTGGC CTACGGTGGC GCGTCTCAGG GCATGCGTTG AGAAGAACTT CGGTGTCGAA 720 

GCTTGTGGCA TAGCTCTTCG TGGCTATTAC ACCTCTCGCA ATGTTTATCA CTGTGATTAT 7 80 

GACTCTGCTT ATGTAAAATA TTTTAGAAAC CTTTCCGGCC GCATTGGCGG TGGTTCGTTC 840 

GATCCGACAT CTTTAACCTC CGTAATT^CG GTGTU^GATTA GCGGTCTTCC AGGTGGTCTT 900 

CCTAAAAATA TAGCGTTTGG TGCCTTCCTG TGCGATATAC GTTACGTCGA ACCGGTAGAC 960 

TCGGGCGGCA TTCAATCGAG CGTTAAGACG AAACGTGAAG ATGCGCACCG AACCGTAGAG 1020 

GAACGGGCGG CCGGCGGATC CGTCGAGCAA CCGCGACAAA AGAGGATAGA TGAGAAAGGT 1080 

TGCGGCAGAG TTCCTAGTGG AGGTTTTTCG CATCTCCTGG TCGGCAACCT TAACGAAGTT 114 0 

AGGAGGAAGG TAGCTGCCGG ACTTCTACGC TTTCGCGTTG GCGGTGATAT GGATTTTCAT 1200 

CGCTCGTTCT CCACCCAAGC GGGCCACCGC TTGCTGGTGT GGCGCCGCTC GAGCCGGAGC 12 60 

GTGTGCCTTG AACTTTACTC ACCATCTAAA AACTTTTTGC GTTACGATGT CTTGCCCTGT 132 0 

TCTGGAGACT ATGCAGCGAT GTTTTCTTTC GCGGCGGGCG GCCGTTTCCC TTTAGTTTTG 1380 

ATGACTAGAA TTAGATACCC GAACGGGTTT TGTTACTTGG CTCACTGCCG GTACGCGTGC 14 4 0 

GCGTTTCTCT TAAGGGGTTT TGATCCGAAG CGTTTCGACA TCGGTGCTTT CCCCACCGCG 1500 

GCCAAGCTCA GAAACCGTAT GGTTTCGGAG CTTGGTGAAA GAAGTTTAGG TTTGAACTTG 1560 

TACGGCGCAT ATACGTCACG CGGCGTCTTT CACTGCGATT ATGACGCTAA GTTTATAAAG 162 0 

GATTTGCGTC TTATGTCAGC AGTTATAGCT GGAAAGGACG GGGTGGAAGA GGTGGTACCT 1680 
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TCTGACATAA CTCCTGCCAT GAAGCAGAAA ACGATCGAAG CCGTGTATGA TAGATTATAT 17 4 0 

GGCGGCACTG ACTCGTTGCT GAAACTGAGC ATCGAGAAAG ACTTAATCGA TTTCAAAAAT 18 00 

GACGTGCAGA GTTTGAAGAA AGATCGGCCG ATTGTCAAAG TGCCCTTTTA CATGTCGGAA 18 60 

GCAACACAGA ATTCGCTGAC GCGTTTCTAC CCTCAGTTCG AACTTAAGTT TTCGCACTCC 1920 

TCGCATTCAG ATCATCCCGC CGCCGCCGCT TCTAGACTGC TGGAAAATGA AACGTTAGTG 198 0 

CGCTTATGTG GTAATAGCGT TTCAGATATT GGAGGTTGTC CTCTTTTCCA TTTGCATTCC 204 0 

AAGACGCAAA GACGGGTTCA CGTATGTAGG CCTGTGTTGG ATGGCAAGGA TGCGCAGCGT 2100 

CGCGTGGTGC GTGATTTGCA GTATTCCAAC GTGCGTTTGG GAGACGATGA TAAAATTTTG 2160 

GAAGGGCCAC GCAATATCGA CATTTGCCAC TATCCTCTGG GCGCGTGTGA CCACGAAAGT 2220 

AGTGCTATGA TGATGGTGCA GGTGTATGAC GCGTCCCTTT ATGAGATATG TGGCGCCATG 2280 

ATCAAGAAGA AAAGCCGCAT AACGTACTTA ACCATGGTCA CGCCCGGCGA GTTTCTTGAC 2 34 0 

GGACGCGAAT GCGTCTACAT GGAGTCGTTA GACTGTGAGA TTGAAGTTGA TGTGCACGCG 2 4 00 

GACGTCGTAA TGTACAAATT CGGTAGTTCT TGCTATTCGC ACAAGCTTTC AATCATC7VAG 2 4 60 

GACATCATGA CCACTCCGTA CTTGACACTA GGTGGTTTTC TATTCAGCGT GGAGATGTAT 2520 

GAGGTGCGTA TGGGCGTGAA TTACTTCAAG ATTACGAAGT CCGAAGTATC GCCTAGCATT 2 580 

AGCTGCACCA AGCTCCTGAG ATACCGAAGA GCTAATAGTG ACGTGGTTAA AGTTAAACTT 2 64 0 

CCACGTTTCG ATAAGAAACG TCGCATGTGT CTGCCTGGGT ATGACACCAT ATACCTAGAT 27 00 

TCGAAGTTTG TGAGTCGCGT TTTCGATTAT GTCGTGTGTA ATTGCTCTGC CGTGAACTCA 2 7 60 

AAAACTTTCG AGTGGGTGTG GAGTTTCATT AAGTCTAGTA AGTCGAGGGT GATTATTAGC 2820 

GGTAAAATAA TTCACAAGGA TGTGAATTTG GACCTCAAGT ACGTCGAGAG TTTCGCCGCG 2880 

GTTATGTTGG CCTCTGGCGT GCGCAGTAGA CTAGCGTCCG AGTACCTTGC TAAGAACCTT 2 94 0 

AGTCATTTTT CGGGAGATTG CTCCTTTATT GAAGCCACGT CTTTCGTGTT GCGTGAGAAA 3000 

ATCAGAAACA TGACTCTGTU^ TTTTAACGAA AGACTTTTAC AGTTAGTGAA GCGCGTTGCC 3060 

TTTGCGACCT TGGACGTGAG TTTTCTAGAT TTAGATTCAA CTCTTGAATC AATAACTGAT 3120 

TTTGCCGAGT GTAAGGTAGC GATTGAACTC GACGAGTTGG GTTGCTTGAG AGCGGAGGCC 3180 

GAGAATGAAA AAATCAGGAA TCTGGCGGGA GATTCGATTG CGGCTAAACT CGCGAGCGAG 324 0 

ATAGTGGTCG ATATTGACTC TAAGCCTTCA CCGAAGCAGG TGGGT7VATTC GTCATCCGAA 3300 

AACGCCGATA AGCGGGAAGT TCAGAGGCCC GGTTTGCGTG GTGGTTCTAG AAACGGGGTT 3360 

GTTGGGGAGT TCCTTCACTT CGTCGTGGAT TCTGCCTTGC GTCTTTTCAA ATACGCGACG 34 20 

GATCAACAAC GGATCAAGTC TTACGTGCGT TTCTTGGACT CGGCGGTCTC ATTCTTGGAT 34 80 

TACAACTACG ATAATCTATC GTTTATACTG CGAGTGCTTT CGGAAGGTTA TTCGTGTATG 354 0 
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TTCGCGTTTT TGGCGAATCG CGGCGACTTA TCTAGTCGTG TCCGTAGCGC GGTGTGTGCT 3600 

GTGAAAGAAG TTGCTACCTC ATGCGCGAAC GCGAGCGTTT CTAAAGCCAA GGTTATGATT 3660 

ACCTTCGCAG CGGCCGTGTG TGCTATGATG TTTAATAGCT GCGGTTTTTC AGGCGACGGT 372 0 

CGGGAGTATA AATCGTATAT ACATCGTTAC ACGCAAGTAT TGTTTGACAC TATCTTTTTT 37 8 0 

GAGGACAGCA GTTACCTACC CATAGAAGTT CTGAGTTCGG CGATATGCGG TGCTATCGTC 384 0 

ACACTTTTCT CCTCGGGCTC GTCCATAAGT TTAT^CGCCT TCTTACTTCA AATTACCAAA 3900 

GGATTCTCCC TAGAGGTTGT CGTCCGGAAT GTTGTGCGAG TCACGCATGG TTTGAGCACC 3960 

ACAGCGACCG ACGGCGTCAT ACGTGGGGTT TTCTCCCAAA TTGTGTCTCA CTTACTTGTT 4 020 

GGAAATACGG GTAATGTGGC TTACCAGTCA GCTTTCATTG CCGGGGTGGT GCCTCTTTTA 4 08 0 

GTTTWW^GT GTGTGAGCTT AATCTTCATC TTGCGTGAAG ATACTTATTC CGGTTTTATT 414 0 

AAGCACGGAA TCAGTGAATT CTCTTTCCTT AGTAGTATTC TGAAGTTCTT GAAGGGTAAG 4 200 

CTTGTGGACG AGTTGAAATC GATTATTCAA GGGGTTTTTG ATTCCAACAA GCACGTGTTT 4 2 60 

AAAG/^GCTA CTCAGGAAGC GATTCGTACG ACGGTCATGC AAGTGCCTGT CGCTGTAGTG 4 32 0 

GATGCCCTTA AGAGCGCCGC GGGAAAAATT TATAACAATT TTACTAGTCG ACGTACCTTT 4 38 0 

GGTAAGGATG AAGGCTCCTC TAGCGACGGC GCATGTGAAG AGTATTTCTC ATGCGACGAA 444 0 

GGTGAAGGTC CGGGTCTGAA AGGGGGTTCC AGCTATGGCT TCTCAATTTT AGCGTTCTTT 4 500 

TCACGCATTA TGTGGGGAGC TCGTCGGCTT ATTGTTT^GG TGAAGCATGA GTGTTTTGGG 4 5 60 

A7VACTTTTTG AATTTCTATC GCTCAAGCTT CACGAATTCA GGACTCGCGT TTTTGGGAAG 4 620 

AATAGAACGG ACGTGGGAGT TTACGATTTT TTGCCCACGG GCATCGTGGA AACGCTCTCA 4 680 

TCGATAGAAG AGTGCGACCA AATTGAAGAA CTTCTCGGCG ACGACCTGAA AGGTGACAAG 4 74 0 

GATGCTTCGT TGACCGATAT GAATTACTTT GAGTTCTCAG AAGACTTCTT AGCCTCTATC 4 800 

GAGGAGCCGC CTTTCGCTGG ATTGCGAGGA GGTAGCAAGA ACATCGCGAT TTTGGCGATT 4 860 

TTGGAATACG CGCATAATTT GTTTCGCATT GTCGCAAGCA AGTGTTCGAA ACGACCTTTA 4 920 

TTTCTTGCTT TCGCCGAACT CTCAAGCGCC CTTATCGAGA AATTTAAGGA GGTTTTCCCT 4 980 

CGTAAGAGCC AGCTCGTCGC TATCGTGCGC GAGTATACTC AGAGATTCCT CCGAAGTCGC 504 0 

ATGCGTGCGT TGGGTTTGAA TAACGAGTTC GTGGTAAAAT CTTTCGCCGA TTTGCTACCC 5100 

GCATTAATGA AGCGGAAGGT TTCAGGTTCG TTCTTAGCTA GTGTTTATCG CCCACTTAGA 5160 

GGTTTCTCAT ATATGTGTGT TTCAGCGGAG CGACGTGAAA AGTTTTTTGC TCTCGTGTGT 5220 

TTAATCGGGT TAAGTCTCCC TTTCTTCGTG CGCATCGTAG GAGCGAAAGC GTGCGAAGAA 52 80 

CTCGTGTCCT CAGCGCGTCG CTTTTATGAG CGTATTAAAA TTTTTCTAAG GCAGAAGTAT 5 34 0 
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GTCTCTCTTT CTAATTTCTT TTGTCACTTG TTTAGCTCTG ACGTTGATGA CAGTTCCGCA 54 00 

TCTGCAGGGT TGAAAGGTGG TGCGTCGCGA ATGACGCTCT TCCACCTTCT GGTTCGCCTT 54 60 

GCTAGTGCCC TCCTATCGTT AGGGTGGGAA GGGTTAAAGC TACTCTTATC GCACCACAAC 5520 

TTGTTATTTT TGTGTTTTGC ATTGGTTGAC GATGTGAACG TCCTTATCAA AGTTCTTGGG 5580 

GGTCTTTCTT TCTTTGTGCA ACCAATCTTT TCCTTGTTTG CGGCGATGCT TCTACAACCG 5640 

GACAGGTTTG TGGAGTATTC CGAGAAACTT GTTACAGCGT TTGAATTTTT CTTAAAATGT 57 00 

TCGCCTCGCG CGCCTGCACT ACTCAAAGGG TTTTTTGAGT GCGTGGCGAA CAGCACTGTG 57 60 

TCAAAAACCG TTCGAAGACT TCTTCGCTGT TTCGTGAAGA TGCTCAAACT TCGA7VAAGGG 5820 

CGAGGGTTGC GTGCGGATGG TAGGGGTCTC CATCGGCAGA AAGCCGTACC CGTCATACCT 58 80 

TCTAATCGGG TCGTGACCGA CGGGGTTGAA AGACTTTCGG TAAAGATGCA AGGAGTTGAA 5 94 0 

GCGTTGCGTA CCGAATTGAG AATCTTAGAA GATTTAGATT CTGCCGTGAT CGAAAAACTC 6000 

AATAGACGCA GAAATCGTGA CACTAATGAC GACGAATTTA CGCGCCCTGC TCATGAGCAG 60 60 

ATGCAAGAAG TCACCACTTT CTGTTCGAAA GCCAACTCTG CTGGTTTGGC CCTGGAAAGG 6120 
GCAGTGCTTG TGGAAGACGC TATAAAGTCG GAGAAACTTT CTAAGACGGT TAATGAGATG 618 0 

GTGAGGAAAG GGAGTACCAC CAGCGAAGAA GTGGCCGTCG CTTTGTCGGA CGATGAAGCC 62 4 0 

GTGGAAGAAA TCTCTGTTGC TGACGAGCGA GACGATTCGC CTAAGACAGT CAGGATAAGC 6300 
GAATACCTAA ATAGGTTAAA CTCAAGCTTC GAATTCCCGA AGCCTATTGT TGTGGACGAC 63 60 

AACAAGGATA CCGGGGGTCT AACGAACGCC GTGAGGGAGT TTTATTATAT GCAAGAACTT 64 20 

GCTCTTTTCG AAATCCACAG CTW^CTGTGC ACCTACTACG ATCAACTGCG CATAGTCAAC 64 80 

TTCGATCGTT CCGTAGCACC ATGCAGCGT^ GATGCTCAGC TGTACGTACG GAAGAACGGC 654 0 

TCAACGATAG TGCAGGGTAA AGAGGTACGT TTGCACATTA AGGATTTCCA CGATCACGAT 6 600 

TTCCTGTTTG ACGGAAAAAT TTCTATTAAC AAGCGGCGGC GAGGCGGAAA TGTTTTATAT 6660 
CACGACT^CC TCGCGTTCTT GGCGAGTAAT TTGTTCTTAG CCGGCTACCC CTTTTCAAGG 6720 
AGCTTCGTCT TCACGAATTC GTCGGTCGAT ATTCTCCTCT ACG7VAGCTCC ACCCGGAGGT 6780 
GGTAAGACGA CGACGCTGAT TGACTCGTTC TTGAAGGTCT TCAAGAAAGG TGAGGTTTCC 684 0 

ACCATGATCT TAACCGCCAA CAAAAGTTCG CAGGTTGAGA TCCTAAAGAA AGTGGAGAAG 6900 
G7VAGTGTCTA ACATTGAATG CCAGAAACGT AAAGACAAAA GATCTCCGAA AAAGAGCATT 6960 
TACACCATCG ACGCTTATTT AATGCATCAC CGTGGTTGTG ATGCAGACGT TCTTTTCATC 7020 
GATGAGTGTT TCATGGTTCA TGCGGGTAGC GTACTAGCTT GCATTGAGTT CACGAGGTGT 7080 
CATAT^GTAA TGATCTTCGG GGATAGCCGG CAGATTCACT ACATTGAAAG GAACG7U\TTG 714 0 

GACAAGTGTT TGTATGGGGA TCTCGACAGG TTCGTGGACC TGCAGTGTCG GGTTTATGGT 7 200 
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AATATTTCGT ACCGTTGTCC ATGGGATGTG TGCGCTTGGT TAAGCACAGT GTATGGCAAC 7260 

CTAATCGCCA CCGTGAAGGG TGAAAGCGAA GGTAAGAGCA GCATGCGCAT TAACGAAATT 7320 

AATTCAGTCG ACGATTTAGT CCCCGACGTG GGTTCCACGT TTCTGTGTAT GCTTCAGTCG 7 380 

GAGT^GTTGG AAATCAGCAA GCACTTTATT CGCAAGGGTT TGACTAAACT TAACGTTCTA 74 4 0 

ACGGTGCATG AGGCGCAAGG TGAGACGTAT GCGCGTGTGA ACCTTGTGCG ACTTAAGTTT 7 500 

CAGGAGGATG AACCCTTTAA ATCTATCAGG CACAT7VACCG TCGCTCTTTC TCGTCACACC 7560 

GACAGCTTAA CTTATAACGT CTTAGCTGCT CGTCGAGGTG ACGCCACTTG CGATGCCATC 7 620 

CAGAAGGCTG CGGT^TTGGT GAACAAGTTT CGCGTTTTTC CTACATCTTT TGGTGGTAGT 7 680 

GTTATCAATC TCAACGTGAA GT^GGACGTG GAAGATAACA GTAGGTGCAA GGCTTCGTCG 77 4 0 

GCACCATTGA GCGTAATCAA CGACTTTTTG AACGAAGTTA ATCCCGGTAC TGCGGTGATT 7 800 

GATTTTGGTG ATTTGTCCGC GGACTTCAGT ACTGGGCCTT TTGAGTGCGG TGCCAGCGGT 7860 

ATTGTGGTGC GGGACAACAT CTCCTCCAGC AACATCACTG ATCACGATAA GCAGCGTGTT 7 920 

TAGCGTAGTT CGGTCGCAGG CGATTCCGCG TAGAAAACCT TCTCTACAAG AAAATTTGTA 7 980 

TTCGTTTGAA GCGCGGAATT ATAACTTCTC GACTTGCGAC CGTAACACAT CTGCTTCAAT 804 0 

GTTCGGAGAG GCTATGGCGA TGAACTGTCT TCGTCGTTGC TTCGACCTAG ATGCCTTTTC 8100 

GTCCCTGCGT GATGATGTGA TTAGTATCAC ACGTTCAGGC ATCGAACAAT GGCTGGAGAA 8160 

ACGTACTCCT AGTCAGATTA AAGCATTAAT GAAGGATGTT GAATCGCCTT TGGAAATTGA 822 0 

CGATGAAATT TGTCGTTTTA AGTTGATGGT GAAGCGTGAC GCTAAGGTGA AGTTAGACTC 828 0 

TTCTTGTTTA ACTPJ\ACACA GCGCCGCTCA AAATATCATG TTTCATCGCA AGAGCATTAA 834 0 

TGCTATCTTC TCTCCTATCT TTAATGAGGT GAAAAACCGA ATAATGTGCT GTCTTAAGCC 8 4 00 

TAACATAAAG TTTTTTACGG AGATGACTAA CAGGGATTTT GCTTCTGTTG TCAGCAACAT 84 60 

GCTTGGTGAC GACGATGTGT ACCATATAGG TGAAGTTGAT TTCTCAAAGT ACGAC7\AGTC 8520 

TCAAGATGCT TTCGTGAAGG CTTTTG7VAGA AGTAATGTAT AAGGAACTCG GTGTTGATGA 8580 

AGAGTTGCTG GCTATCTGGA TGTGCGGCGA GCGGTTATCG ATAGCTAACA CTCTCGATGG 8 64 0 

TCAGTTGTCC TTCACGATCG AGAATCAAAG GAAGTCGGGA GCTTCGAACA CTTGGATTGG 87 00 

TAACTCTCTC GTCACTTTGG GTATTTTAAG TCTTTACTAC GACGTTAGAA ATTTCGAGGC 87 60 

GTTGTACATC TCGGGCGATG ATTCTTTAAT TTTTTCTCGC AGCGAGATTT CG7\ATTATGC 8820 

CGACGACATA TGGACTGACA TGGGTTTTGA GACAAAATTT ATGTCCCCAA GTGTCCCGTA 8880 

CTTTTGTTCT AT^TTTGTTG TTATGTGTGG TCATAAGACG TTTTTTGTTC CCGACCCGTA 8 94 0 

CAAGCTTTTT GTCAAGTTGG GAGCAGTCAA AGAGGATGTT TCAATGGATT TCCTTTTCGA 9000 
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GACTTTTACC TCCTTTAAAG ACTTAACCTC CGATTTTAAC GACGAGCGCT TAATTCAAAA 9060 

GCTCGCTGAA CTTGTGGCTT TAAAATATGA GGTTCAAACC GGCAACACCA CCTTGGCGTT 9120 

AAGTGTGATA CATTGTTTGC GTTCGAATTT CCTCTCGTTT AGCAAGTTAT ATCCTCGCGT 918 0 

GAAGGGATGG CAGGTTTTTT ACACGTCGGT TAAGAAAGCG CTTCTCAAGA GTGGGTGTTC 92 4 0 

TCTCTTCGAC AGTTTCATGA CCCCTTTTGG TCAGGCTGTC ATGGTTTGGG ATGATGAGTA 9300 

GCGCTAACTT GTGCGCAGTT TCTTTGTTCG TGACATACAC CTTGTGTGTC ACCGTGCGTT 9360 

TATAATGAAT CAGGTTTTGC AGTTTGAATG TTTGTTTCTG CTGAATCTCG CGGTTTTTGC 94 20 

TGTGACTTTC ATTTTCATTC TTCTGGTCTT CCGCGTGATT AAGTCTTTTC GCCAGAAGGG 94 8 0 

TCACGAAGCA CCTGTTCCCG TTGTTCGTGG CGGGGGTTTT TCAACCGTAG TGTAGTCAAA 954 0 

AGACGCGCAT ATGGTAGTTT TCGGTTTGGA CTTTGGCACC ACATTCTCTA CGGTGTGTGT 9600 

GTACAAGGAT GGACGAGTTT TTTCATTCAA GCAGAATAAT TCGGCGTACA TCCCCACTTA 9660 

CCTCTATCTC TTCTCCGATT CTAACCACAT GACTTTTGGT TACGAGGCCG AATCACTGAT 97 20 

GAGTAATCTG AAAGTTAAAG GTTCGTTTTA TAGAGATTTA AAACGTTGGG TGGGTTGCGA 97 80 

TTCGAGTAAC CTCGACGCGT ACCTTGACCG TTTAAAACCT CATTACTCGG TCCGCTTGGT 98 4 0 

TAAGATCGGC TCTGGCTTGA ACGAAACTGT TTCAATTGGA AACTTCGGGG GCACTGTTAA 9900 

GTCTGAGGCT CATCTGCCAG GGTTGATAGC TCTCTTTATT AAGGCTGTCA TTAGTTGCGC 9 960 

GGAGGGCGCG TTTGCGTGCA CTTGCACCGG GGTTATTTGT TCAGTACCTG CCAATTATGA 10020 

TAGCGTTCAA AGGAATTTCA CTGATCAGTG TGTTTCACTC AGCGGTTATC AGTGCGTATA 10080 

TATGATCAAT GAACCTTCAG CGGCTGCGCT ATCTGCGTGT AATTCGATTG GAAAGAAGTC 1014 0 

CGCAAATTTG GCTGTTTACG ATTTCGGTGG TGGGACCTTC GACGTGTCTA TCATTTCATA 10200 

CCGCAACAAT ACTTTTGTTG TGCGAGCTTC TGGAGGCGAT CTAAATCTCG GTGGAAGGGA 102 60 

TGTTGATCGT GCGTTTCTCA CGCACCTCTT CTCTTTAACA TCGCTGGAAC CTGACCTCAC 10320 

TTTGGATATC TCGAATCTGA AAGAATCTTT ATCAAAAACG GACGCAGAGA TAGTTTACAC 10380 

TTTGAGAGGT GTCGATGGAA GAAAAGAAGA CGTTAGAGTA AACAAAAACA TTCTTACGTC 10440 

GGTGATGCTC CCCTACGTGA ACAGAACGCT TAAGATATTA GAGTCAACCT TAAAATCGTA 10500 

TGCTAAGAGT ATGAATGAGA GTGCGCGAGT TAAGTGCGAT TTAGTGCTGA TAGGAGGATC 105 60 

TTCATATCTT CCTGGCCTGG CAGACGTACT AACGAAGCAT CAGAGCGTTG ATCGTATCTT 10 620 
AAGAGTTTCG GATCCTCGGG CTGCCGTGGC CGTCGGTTGC GCATTATATT CTTCATGCCT 10680 
CTCAGGATCT GGGGGGTTGC TACTGATCGA CTGTGCAGCT CACACTGTCG CTATAGCGGA 10740 
CAGAAGTTGT CATCAAATCA TTTGCGCTCC AGCGGGGGCA CCGATCCCCT TTTCAGGAAG 10800 
CATGCCTTTG TACTTAGCCA GGGTCAACAA GAACTCGCAG CGTGAAGTCG CCGTGTTTGA 108 60 
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oil /"IrtVj i Vjk^^ 


i AAb AAG AG 


7\ T\ TV TV rn m m 

AAAGATCTGT 


GGAGCAAATA 


TAAGATTTTT 


10920 


1 o/i J. M 1 


O 1 Ort\-,OL30 X O 


Ai i UG i AGGG 


ACCCGTTACC 


TTCTATATGG 


ATTTCTCCAT 


10980 




Vj50/^0^^Vj 111 


U,Ai i CG 1 GG i 


GAGAGGTCCT 


GAGGGTAAGC 


AAGTGTCACT 


11040 






AU i i i 1 CG 1 G 


TGTGGCTCTC 


GGATCACGCA 


GTGTCCGAGA 


11100 


ML 1 KjK^M. 1 1 1 


Mo 111 AAA 1 A 


A T'A A A/^TirpTtT" 

A i AAAG TTTT 


TCTCGGTTTG 


CTTCTACATA 


GAAAGGCGGA 


11160 




L- 1 i 1 1 v^AG i A 


A ^ A *T>/^ TV TV 

AGGATGAAGC 


GATTCGATAC 


GCCGATTCAA 


TTGATATCGC 


11220 


(jUA 1 Cj 1 (jC i. A 


AAo^jAA I a i a 


AAAGTTACGC 


GGCCAGTGCC 


TTACCACCAG 


ACGAGGATGT 


11280 


ULa/iA i 1 AU 1 L- 


U I ooljAAALa 1 


rp rp rp 7v Ti tv tv 

G rGTTC/\AAA 


AGTTTTACGG 


GGAAGCAGAC 


TGGAAGAAAT 


11340 


ACC 1 i L. i A(j 


GACjGA 1 AGGA 


GCACACTCAA 


GTGAAATTAA 


AACTCTACCA 


GACATTCGAT 


11400 


1 I AL-LjLjL^ijL) 


JlAvjljbi IGIA 


AAGAAGTCCG 


AATTCGAATC 


AGCACTTCCT 


AATTCTTTTG 


11460 


AAC-ACjCjAA J. 1 


AGGAG i G i 1 G 


ATACTGAGCG 


7VACGGGAAGT 


GGGATGGAGC 


AAATTATGCG 


11520 


/-^ 7\ 7\rp7\ T\ rr> 

bAA 1 AALCjCd J 


GGAAGAAGCA 


GCATACGATC 


TTACGAATCC 


C7VAGGCTTAT 


AAATTCACTG 


11580 


CCGAGACATG 


TAGCCCGGAT 


GTAAAAGGTG 


AAGGACAAAA 


ATACTCTATG 


GAAGACGTGA 


11640 


TGAATTTCAT 


GCGTTTATCA 


AATCTGGATG 


TTAACGACAA 


GATGCTGACG 


GAACAGTGTT 


11700 


GGTCGCTGTC 


CAAi TGATGC 


GGTGAATTGA 


TCAACCCAGA 


CGACAAAGGG 


CGATTCGTGG 


11760 


L i C J CACC I 1 


i AAGGAGAGA 


GACACAGCTG 


ATGACACGGG 


TGCCGCCAAC 


GTGGAATGTC 


11820 


ryi ^ ^ ^ tv 

GCGTGGGCGA 


CTATCTAGTT 


TACGCTATGT 


CCCTGTTTGA 


GCAGAGGACC 


CAAAAATCGC 


11880 


AG I G i GGGAA 


O A T* 0 1^ rp rp 

GAi G 1 G 1 G TG 


TACGAAAAGT 


ACTGTGAATA 


CATCAGGACC 


TACTTAGGGA 


11940 


G i ALAGAGG J 


G i i G i i GAGA 


GCGCCGGACA 


GGATTCCGTT 


ACTTACGGGC 


ATCCTATACG 


12000 


Tv rp rp rprp rp rp tv tv 

Aili iiGi AA 


GGAA i AGAAG 


GTTTTCTACT 


CGTCATATAA 


GAGAAACGTC 


GATAATTTCA 


12060 


GAi I G r TGTT 


r" ^ 7\ ArprpTvm 

GGGGAAi TAT 


ATGCCTTTGA 


TATCTGACGT 


CTTTGTCTTC 


CAGTGGGTAA 


12120 


AA v_ ^ o o 


ooAIIjI i 


rp/^ rprp rprp rp 

G i GG i i i TTG 


AGTTAAGTGC 


AGCGGAACTA 


ACGCTGGAGG 


12180 


1 1 \— v--*^A^^A^ J 


oAo 111 oA 1 A 


GAI 1 G J GAAG 


TTGTGGTAGG 


TCATATCTTA 


AGATACGTAG 


12240 




A 1 *^AL3A 1 L^l_A 


taL-UAi GGAGG 


/""rprp TV TV TV Tv 

CGTTAGTIAGA 


CAAACTGG7\A 


GCGATACTGA 


12300 


AAAb 1 AGGAA 


i L^GGGG i G 1 A 


i GGAGAGCGC 


AACTATGGGT 


TGGTTTCTTT 


TGTTACTATG 


12360 


G X GAG ill GG 


1 AGGGG 1 GAA 


AGTAGAGTAG 


TGCAAAGACC 


AGGCGTATAC 


AAAACACCTG 


12420 


ACTCAGTGGG 


TGGATTTGAA 


ATAAACATGA 


AAGATGTTGA 


GAAATTCTTC 


GAT7\AACTTC 


12480 


AGAGAGAATT 


GCCTAATGTA 


TCTTTGCGGC 


GTCAGTTTAA 


CGGAGCTAGA 


GCGCATGAGG 


12540 


CTTTCAAAAT 


ATTTAAAAAC 


GGAAATATAA 


GTTTCAGACC 


TATATCGCGT 


TT7U\ACGTGC 


12600 


CTAGAGAGTT 


CTGGTATCTG 


AACATAGACT 


ACTTCAGGCA 


CGCGAATAGG 


TCCGGGTTAA 


12660 
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CCGAAGAAGA AATACTCATC CTAAACAACA TAAGCGTTGA TGTTAGGAAG TTATGCGCTG 12720 

AGAGAGCGTG CAATACCCTA CCTAGCGCGA AGCGCTTTAG TAAAT^TCAT AAGAGTAATA 12780 

TACAATCATC ACGCCAAGAG CGGAGGATTA AAGACCCATT GGTAGTCCTG AAAGACACTT 12840 

TATATGAGTT CCAACACAAG CGTGCCGGTT GGGGGTCTCG AAGCACTCGA GACCTCGGGA 12 900 

GTCGTGCTGA CCACGCGAAA GGAAGCGGTT GATAAGTTTT TTAATGAACT AAAAAACGAA 12 960 

AATTACTCAT CAGTTGACAG CAGCCGATTA AGCGATTCGG AAGTATW^GA AGTGTTAGAG 13020 

AAAAGTAAAG T^GTTTCAA AAGCGAACTG GCCTCCACTG ACGAGCACTT CGTCTACCAC 13080 

ATTATATTTT TCTTAATCCG ATGTGCTAAG ATATCGACAA GTGAAAAGGT GAAGTACGTT 1314 0 

GGTAGTCATA CGTACGTGGT CGACGG7UW\ ACGTACACCG TTCTTGACGC TTGGGTATTC 13200 

AACATGATGA AAAGTCTCAC GAAGTIAGTAC 7W\CGAGTGA ATGGTCTGCG TGCGTTCTGT 132 60 

TGCGCGTGCG AAGATCTATA TCT7VACCGTC GCACCAATAA TGTCAGAACG CTTTAAGACT 13320 

AAAGCCGTAG GGATG/W^GG TTTGCCTGTT GGAAAGGAAT ACTTAGGCGC CGACTTTCTT 13380 

TCGGGAACTA GCAAACTGAT GAGCGATCAC GACAGGGCGG TCTCCATCGT TGCAGCGAAA 13440 

AACGCTGTCG ATCGTAGCGC TTTCACGGGT GGGGAGAGAA AGATAGTTAG TTTGTATGAT 13500 

CTAGGGAGGT ACTAAGCACG GTGTGCTATA GTGCGTGCTA TAAT7VATAAA CACTAGTGCT 135 60 

TAAGTCGCGC AGAAGAAAAC GCTATGGAGT TGATGTCCGA CAGCAACCTT AGCAACCTGG 13 620 

TGATAACCGA CGCCTCTAGT CTAAATGGTG TCGACAAGAA GCTTTTATCT GCTGAAGTTG 13680 

AAAAAATGTT GGTGCAGAAA GGGGCTCCTA ACGAGGGTAT AGAAGTGGTG TTCGGTCTAC 13740 

TCCTTTACGC ACTCGCGGCA AGAACCACGT CTCCTAAGGT TCAGCGCGCA GATTCAGACG 138 00 

TTATATTTTC AAATAGTTTC GGAGAGAGGA ATGTGGTAGT AACAGAGGGT GACCTTAAGA 138 60 

AGGTACTCGA CGGGTGTGCG CCTCTCACTA GGTTCACTAA TAAACTTAGA ACGTTCGGTC 13920 

GTACTTTCAC TGAGGCTTAC GTTGACTTTT GTATCGCGTA TAAGCACAAA TTACCCCAAC 13 980 

TCAACGCCGC GGCGGAATTG GGGATTCCAG CTGAAGATTC GTACTTAGCT GCAGATTTTC 14 040 

TGGGTACTTG CCCGAAGCTC TCTGAATTAC AGCAAAGTAG GAAGATGTTC GCGAGTATGT 14100 

ACGCTCTAAA AACTGAAGGT GGAGTGGTAA ATACACCAGT GAGCAATCTG CGTCAGCTAG 14160 

GTAG7U\GGGA AGTTATGTAA TGGT^GATTA CGAAGAAAAA TCCGAATCGC TCATACTGCT 14 220 

ACGCACGAAT CTGAACACTA TGCTTTTAGT GGTCAAGTCC GATGCTAGTG TAGAGCTGCC 14 280 

TT^^CTACTA ATTTGCGGTT ACTTACGAGT GTCAGGACGT GGGGAGGTGA CGTGTTGCAA 14 340 

CCGTGAGGAA TTAACAAGAG ATTTTGAGGG CAATCATCAT ACGGTGATCC GTTCTAGAAT 14 4 00 

CATAC7VATAT GACAGCGAGT CTGCTTTTGA GGAATTCAAC AACTCTGATT GCGTAGTGAA 14 4 60 

GTTTTTCCTA GAGACTGGTA GTGTCTTTTG GTTTTTCCTT CGAAGTGAAA CCAAAGGTAG 14 520 
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AGCGGTGCGA 


CATTTGCGCA 


CCTTCTTCGA 


AGCTAAC7VAT 


TTCTTCTTTG 


GATCGCATTG 


14580 


CGGTACCATG 


GAGTATTGTT 


TGAAGCAGGT 


ACTAACTGAA 


ACTGAATCTA 


T7VATCGATTC 


14640 


TTTTTGCGAA 


GAAAGAAATC 


GTTAAGATGA 


GGGTTATAGT 


GTCTCCTTAT 


GAAGCTGAAG 


14700 


ACATTCTGAA 


AAGATCGACT 


GACATGTTAC 


GAAACATAGA 


CAGTGGGGTC 


TTGAGCACTA 


14760 


AAGAATGTAT 


CAlAGGCATTC 


TCGACGATAA 


CGCGAGACCT 


ACATTGTGCG 


AAGGCTTCCT 


14820 


ACCAGTGGGG 


TGTTGACACT 


GGGTTATATC 


AGCGTT^TTG 


CGCTGAAAAA 


CGTTTAATTG 


14880 


ACACGGTGGA 


GTCAAACATA 


CGGTTGGCTC 


AACCTCTCGT 


GCGTGAAAAA 


GTGGCGGTTC 


14940 


ATTTTTGTAA 


GGATGAACCA 


AAAGAGCTAG 


TAG CAT T CAT 


CACGCGAAAG 


TACGTGGAAC 


15000 


TCACGGGCGT 


G G GAG T GAGA 


GAAGCGGTGA 


AGAGGGAAAT 


GCGCTCTCTT 


ACCAAAACAG 


15060 


TTTTAAATAA 


AATGTCTTTG 


GAAATGGCGT 


TTTACATGTC 


ACCACGAGCG 


TGGAAAAACG 


15120 


CTGAATGGTT 


AGAACTAAAA 


TTTTCACCTG 


TGAAAATCTT 


TAGAGATCTG 


CTATTAGACG 


15180 


TGGAAACGCT 


CAACGAATTG 


TGCGCCGAAG 


ATGATGTTCA 


CGTCGACAAA 


GTAAATGAGA 


15240 


ATGGGGACGA 


AAATCACGAC 


CTCGAACTCC 


AAGACGAATG 


TTAAACATTG 


GTTAAGTTTA 


15300 


ACGAAAATGA 












15360 


AAGCTGTTAC 


TGAGTAATTA 


AACCAACAAG 


TGTTGGTGTA 


ATGTGTATGT 


TGATGTAGAG 


15420 


AAAAATCCGT 


TTGTAGAACG 


GTGTTTTTCT 


CTTCTTTATT 


TTTAAAAAAA 


AAATAAAAAA 


15480 


AAAAAAAAAA 


AAGCGGCCGC 










15500 



Another DNA molecule of the present invention (GLRaV-2 ORFla) includes 
nucleotides 4-7923 of SEQ. ID. No. 1 and is believed to code for a large, grapevine leafroll 
virus polyprotein containing the conserved domains characteristic of two papain-like 
proteases, a methyl transferase, and a helicase. This DNA molecule comprises the nucleotide 
sequence corresponding to SEQ. ID. No. 2 as follows: 



ACATTGCGAG 


AGAACCCCAT 


TAGCGTCTCC 


GGGGTGAACT 


TGGGAAGGTC 


TGCCGCCGCT 


60 


CAGGTTATTT 


ATTTCGGCAG 


TTTCACGCAG 


CCCTTCGCGT 


TGTATCCGCG 


CCAAGAGAGC 


120 


GCGATCGTAA 


AAACGCAACT 


TCCACCGGTC 


AGTGTAGTGA 


AGGTGGAGTG 


CGTAGCTGCG 


180 


GAGGTAGCTC 


CCGACAGGGG 


CGTGGTCGAC 


AAGAAACCTA 


CGTCTGTTGG 


CGTTCCCCCG 


240 


CAGCGCGGTG 


TGCTTTCTTT 


TCCGACGGTG 


GTTCGGAACC 


GCGGCGACGT 


GATAATCACA 


300 


GGGGTGGTGC 


ATGAAGCCCT 


GAAGAAAATT 


AAAGACGGGC 


TCTTACGCTT 


CCGCGTAGGC 


360 


GGTGACATGC 


GTTTTTCGAG 


ATTTTTCTCA 


TCGAACTACG 


GCTGCAGATT 


CGTCGCGAGC 


420 


GTGCGTACGA 


ACACTACAGT 


TTGGCTAAAT 


TGCACGAAAG 


CGAGTGGTGA 


GAAATTCTCA 


480 
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CTCGCCGCCG CGTGCACGGC GGATTACGTG GCGATGCTGC GTTATGTGTG TGGCGGGAAA 54 0 

TTTCCACTCG TCCTCATGAG TAGAGTTATT TACCCGGATG GGCGCTGTTA CTTGGCCCAT 600 

ATGAGGTATT TGTGCGCCTT TTACTGTCGC CCGTTTAGAG AGTCGGATTA TGCCCTCGGA 660 

ATGTGGCCTA CGGTGGCGCG TCTCAGGGCA TGCGTTGAGA AGAACTTCGG TGTCGAAGCT 720 

TGTGGCATAG CTCTTCGTGG CTATTACACC TCTCGCAATG TTTATCACTG TGATTATGAC 7 80 

TCTGCTTATG TAAAATATTT TAGAAACCTT TCCGGCCGCA TTGGCGGTGG TTCGTTCGAT 84 0 

CCGACATCTT TAACCTCCGT AATAACGGTG AAGATTAGCG GTCTTCCAGG TGGTCTTCCT 900 

AAAAATATAG CGTTTGGTGC CTTCCTGTGC GATATACGTT ACGTCGAACC GGTAGACTCG 960 

GGCGGCATTC AATCGAGCGT TAAGACGAAA CGTGAAGATG CGCACCGAAC CGTAGAGGAA 1020 

CGGGCGGCCG GCGGATCCGT CGAGCT^CCG CGACAAAAGA GGATAGATGA GAAAGGTTGC 1080 

GGCAGAGTTC CTAGTGGAGG TTTTTCGCAT CTCCTGGTCG GCAACCTTAA CGAAGTTAGG 114 0 

AGGAAGGTAG CTGCCGGACT TCTACGCTTT CGCGTTGGCG GTGATATGGA TTTTCATCGC 12 00 

TCGTTCTCCA CCCAAGCGGG CCACCGCTTG CTGGTGTGGC GCCGCTCGAG CCGGAGCGTG 12 60 

TGCCTTGAAC TTTACTCACC ATCTAAAAAC TTTTTGCGTT ACGATGTCTT GCCCTGTTCT 1320 

GGAGACTATG CAGCGATGTT TTCTTTCGCG GCGGGCGGCC GTTTCCCTTT AGTTTTGATG 138 0 

ACTAGAATTA GATACCCGAA CGGGTTTTGT TACTTGGCTC ACTGCCGGTA CGCGTGCGCG 14 4 0 

TTTCTCTTAA GGGGTTTTGA TCCGAAGCGT TTCGACATCG GTGCTTTCCC CACCGCGGCC 1500 

AAGCTCAGAA ACCGTATGGT TTCGGAGCTT GGTGAAAGT^ GTTTAGGTTT G/^CTTGTAC 15 60 

GGCGCATATA CGTCACGCGG CGTCTTTCAC TGCGATTATG ACGCTAAGTT TATAAAGGAT 1620 

TTGCGTCTTA TGTCAGCAGT TATAGCTGGA AAGGACGGGG TGG7VAGAGGT GGTACCTTCT 168 0 

GACATAACTC CTGCCATGAA GCAGAAAACG ATCGT^GCCG TGTATGATAG ATTATATGGC 174 0 

GGCACTGACT CGTTGCTGAA ACTGAGCATC GAGAAAGACT TAATCGATTT CAAAAATGAC 18 00 

GTGCAGAGTT TGAAGAAAGA TCGGCCGATT GTCAAAGTGC CCTTTTACAT GTCGGAAGCA 18 60 

ACACAGAATT CGCTGACGCG TTTCTACCCT CAGTTCGAAC TTAAGTTTTC GCACTCCTCG 1920 

CATTCAGATC ATCCCGCCGC CGCCGCTTCT AGACTGCTGG AAAATGAAAC GTTAGTGCGC 1980 

TTATGTGGTA ATAGCGTTTC AGATATTGGA GGTTGTCCTC TTTTCCATTT GCATTCCAAG 2 04 0 

ACGCAAAGAC GGGTTCACGT ATGTAGGCCT GTGTTGGATG GCAAGGATGC GCAGCGTCGC 2100 

GTGGTGCGTG ATTTGCAGTA TTCCAACGTG CGTTTGGGAG ACGATGATAA AATTTTGGAA 2160 

GGGCCACGCA ATATCGACAT TTGCCACTAT CCTCTGGGCG CGT.GTGACCA CGAAAGTAGT 2220 

GCTATGATGA TGGTGCAGGT GTATGACGCG TCCCTTTATG AGATATGTGG CGCCATGATC 2280 
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AAGAAGAAAA GCCGCATAAC GTACTTAACC ATGGTCACGC CCGGCGAGTT TCTTGACGGA 234 0 

CGCGAATGCG TCTACATGGA GTCGTTAGAC TGTGAGATTG AAGTTGATGT GCACGCGGAC 24 00 

GTCGTAATGT ACAAATTCGG TAGTTCTTGC TATTCGCACA AGCTTTCAAT CATCAAGGAC 24 60 

ATCATGACCA CTCCGTACTT GACACTAGGT GGTTTTCTAT TCAGCGTGGA GATGTATGAG 2520 

GTGCGTATGG GCGTGAATTA CTTCAAGATT ACGAAGTCCG AAGTATCGCC TAGCATTAGC 2580 

TGCACCAAGC TCCTGAGATA CCGAAGAGCT AATAGTGACG TGGTTAAAGT TAAACTTCCA 264 0 

CGTTTCGATA AGAAACGTCG CATGTGTCTG CCTGGGTATG ACACCATATA CCTAGATTCG 2700 

AAGTTTGTGA GTCGCGTTTT CGATTATGTC GTGTGTAATT GCTCTGCCGT GAACTCAAAA 27 60 

ACTTTCGAGT GGGTGTGGAG TTTCATTAAG TCTAGTAAGT CGAGGGTGAT TATTAGCGGT 2820 

AAAATAATTC ACAAGGATGT GAATTTGGAC CTCAAGTACG TCGAGAGTTT CGCCGCGGTT 2880 

ATGTTGGCCT CTGGCGTGCG CAGTAGACTA GCGTCCGAGT ACCTTGCTAA GAACCTTAGT 294 0 

CATTTTTCGG GAGATTGCTC CTTTATTGAA GCCACGTCTT TCGTGTTGCG TGAGAAAATC 3000 

AGAAACATGA CTCTGAATTT TAACGAAAGA CTTTTACAGT TAGTGAAGCG CGTTGCCTTT 3060 

GCGACCTTGG ACGTGAGTTT TCTAGATTTA GATTCAACTC TTGAATCAAT AACTGATTTT 3120 

GCCGAGTGTA AGGTAGCGAT TGAACTCGAC GAGTTGGGTT GCTTGAGAGC GGAGGCCGAG 3180 

AATGAAAAAA TCAGGAATCT GGCGGGAGAT TCGATTGCGG CTAAACTCGC GAGCGAGATA 3240 

GTGGTCGATA TTGACTCTAA GCCTTCACCG AAGCAGGTGG GTAATTCGTC ATCCGAAAAC 3300 

GCCGATAAGC GGGAAGTTCA GAGGCCCGGT TTGCGTGGTG GTTCTAGAAA CGGGGTTGTT 3360 

GGGGAGTTCC TTCACTTCGT CGTGGATTCT GCCTTGCGTC TTTTCAAATA CGCGACGGAT 34 20 

CAACAACGGA TCAAGTCTTA CGTGCGTTTC TTGGACTCGG CGGTCTCATT CTTGGATTAC 34 80 

AACTACGATA ATCTATCGTT TATACTGCGA GTGCTTTCGG AAGGTTATTC GTGTATGTTC 354 0 

GCGTTTTTGG CGAATCGCGG CGACTTATCT AGTCGTGTCC GTAGCGCGGT GTGTGCTGTG 3600 

AAAGAAGTTG CTACCTCATG CGCGAACGCG AGCGTTTCTA AAGCCAAGGT TATGATTACC 3660 

TTCGCAGCGG CCGTGTGTGC TATGATGTTT AATAGCTGCG GTTTTTCAGG CGACGGTCGG 3720 

GAGTATAAAT CGTATATACA TCGTTACACG CAAGTATTGT TTGACACTAT CTTTTTTGAG 3780 

GACAGCAGTT ACCTACCCAT AGAAGTTCTG AGTTCGGCGA TATGCGGTGC TATCGTCACA 384 0 

CTTTTCTCCT CGGGCTCGTC CATAAGTTTA AACGCCTTCT TACTTCAAAT TACCAAAGGA 3900 

TTCTCCCTAG AGGTTGTCGT CCGGAATGTT GTGCGAGTCA CGCATGGTTT GAGCACCACA 3960 

GCGACCGACG GCGTCATACG TGGGGTTTTC TCCCAAATTG TGTCTCACTT ACTTGTTGGA 4 020 

AATACGGGTA ATGTGGCTTA CCAGTCAGCT TTCATTGCCG GGGTGGTGCC TCTTTTAGTT 4 080 

AAAAAGTGTG TGAGCTTAAT CTTCATCTTG CGTGAAGATA CTTATTCCGG TTTTATTAAG 414 0 



wo 98/53055 



-23- 



PCT/US98/10313 



CACGGAATCA GTGAATTCTC TTTCCTTAGT AGTATTCTGA AGTTCTTGAA GGGTT^GCTT 4 200 

GTGGACGAGT TGAAATCGAT TATTCAAGGG GTTTTTGATT CCAACAAGCA CGTGTTTAAA 4 2 60 

GT^GCTACTC AGGAAGCGAT TCGTACGACG GTCATGCAAG TGCCTGTCGC TGTAGTGGAT 4 320 

GCCCTTAAGA GCGCCGCGGG AAAAATTTAT AACAATTTTA CTAGTCGACG TACCTTTGGT 4 380 

AAGGATGAAG GCTCCTCTAG CGACGGCGCA TGTGAAGAGT ATTTCTCATG CGACGAAGGT 4 4 40 

GAAGGTCCGG GTCTGAAAGG GGGTTCCAGC TATGGCTTCT CAATTTTAGC GTTCTTTTCA 4 500 

CGCATTATGT GGGGAGCTCG TCGGCTTATT GTTAAGGTGA AGCATGAGTG TTTTGGGAAA 4 5 60 

CTTTTTGAAT TTCTATCGCT CAAGCTTCAC GAATTCAGGA CTCGCGTTTT TGGGAAGAAT 4 620 

AGAACGGACG TGGGAGTTTA CGATTTTTTG CCCACGGGCA TCGTGGAAAC GCTCTCATCG 4 680 

ATAGAAGAGT GCGACCAAAT TGAAGAACTT CTCGGCGACG ACCTGAAAGG TGACAAGGAT 4740 

GCTTCGTTGA CCGATATG7y\ TTACTTTGAG TTCTCAGAAG ACTTCTTAGC CTCTATCGAG 4 800 

GAGCCGCCTT TCGCTGGATT GCGAGGAGGT AGCAAGAACA TCGCGATTTT GGCGATTTTG 4 8 60 

GAATACGCGC ATAATTTGTT TCGCATTGTC GCAAGCAAGT GTTCGAAACG ACCTTTATTT 4 920 

CTTGCTTTCG CCGAACTCTC AAGCGCCCTT ATCGAGAAAT TTAAGGAGGT TTTCCCTCGT 4 98 0 

AAGAGCCAGC TCGTCGCTAT CGTGCGCGAG TATACTCAGA GATTCCTCCG AAGTCGCATG 504 0 

CGTGCGTTGG GTTTGAATAA CGAGTTCGTG GTAAAATCTT TCGCCGATTT GCTACCCGCA 5100 

TTAATGAAGC GGAAGGTTTC AGGTTCGTTC TTAGCTAGTG TTTATCGCCC ACTTAGAGGT 5160 

TTCTCATATA TGTGTGTTTC AGCGGAGCGA CGTGAAAAGT TTTTTGCTCT CGTGTGTTTA 522 0 

ATCGGGTTAA GTCTCCCTTT CTTCGTGCGC ATCGTAGGAG CGAAAGCGTG CGAAGAACTC 52 80 

GTGTCCTCAG CGCGTCGCTT TTATGAGCGT ATTAAAATTT TTCTAAGGCA GAAGTATGTC 534 0 

TCTCTTTCTA ATTTCTTTTG TCACTTGTTT AGCTCTGACG TTGATGACAG TTCCGCATCT 5 4 00 

GCAGGGTTGA AAGGTGGTGC GTCGCGAATG ACGCTCTTCC ACCTTCTGGT TCGCCTTGCT 54 60 

AGTGCCCTCC TATCGTTAGG GTGGGAAGGG TTAAAGCTAC TCTTATCGCA CCACT^CTTG 5520 
TTATTTTTGT GTTTTGCATT GGTTGACGAT GTGAACGTCC TTATCAAAGT TCTTGGGGGT 5580 
CTTTCTTTCT TTGTGCAACC AATCTTTTCC TTGTTTGCGG CGATGCTTCT AC7VACCGGAC 564 0 

AGGTTTGTGG AGTATTCCGA GTW^CTTGTT ACAGCGTTTG AATTTTTCTT AAAATGTTCG 5700 
CCTCGCGCGC CTGCACTACT CAAAGGGTTT TTTGAGTGCG TGGCGAACAG CACTGTGTCA 57 60 

AAAACCGTTC GAAGACTTCT TCGCTGTTTC GTGAAGATGC TCAAACTTCG AAAAGGGCGA 582 0 
GGGTTGCGTG CGGATGGTAG GGGTCTCCAT CGGCAGAAAG CCGTACCCGT CATACCTTCT 5880 
AATCGGGTCG TGACCGACGG GGTTGAAAGA CTTTCGGTAA AGATGCAAGG AGTTGAAGCG 594 0 



wo 98/53055 



"24- 



PCT/US98/10313 



TTGCGTACCG AATTGAGAAT CTTAGAAGAT TTAGATTCTG CCGTGATCGA AAAACTCAAT 6000 

AGACGCAGAA ATCGTGACAC TAATGACGAC GAATTTACGC GCCCTGCTCA TGAGCAGATG 6060 

CAAGAAGTCA CCACTTTCTG TTCGAAAGCC AACTCTGCTG GTTTGGCCCT GGAAAGGGCA 6120 

GTGCTTGTGG AAGACGCTAT AAAGTCGGAG AAACTTTCTA AGACGGTTAA TGAGATGGTG 618 0 

AGGAAAGGGA GTACCACCAG CGAAGAAGTG GCCGTCGCTT TGTCGGACGA TGAAGCCGTG 624 0 

GAAGAAATCT CTGTTGCTGA CGAGCGAGAC GATTCGCCTA AGACAGTCAG GATAAGCGAA 6300 

TACCTAAATA GGTTAAACTC AAGCTTCGAA TTCCCGAAGC CTATTGTTGT GGACGACAAC 6360 

AAGGATACCG GGGGTCTAAC GAACGCCGTG AGGGAGTTTT ATTATATGCA AGAACTTGCT 6420 

CTTTTCGAAA TCCACAGCAA ACTGTGCACC TACTACGATC AACTGCGCAT AGTCAACTTC 64 80 

GATCGTTCCG TAGCACCATG CAGCGAAGAT GCTCAGCTGT ACGTACGGAA GAACGGCTCA 654 0 

ACGATAGTGC AGGGTAAAGA GGTACGTTTG CACATTAAGG ATTTCCACGA TCACGATTTC 6600 

CTGTTTGACG GAAAAATTTC TATTAACAAG CGGCGGCGAG GCGGAAATGT TTTATATCAC 6660 

GACAACCTCG CGTTCTTGGC GAGTAATTTG TTCTTAGCCG GCTACCCCTT TTCAAGGAGC 6720 

TTCGTCTTCA CGAATTCGTC GGTCGATATT CTCCTCTACG AAGCTCCACC CGGAGGTGGT 67 8 0 

AAGACGACGA CGCTGATTGA CTCGTTCTTG AAGGTCTTCA AGAAAGGTGA GGTTTCCACC 68 4 0 

ATGATCTTAA CCGCCAACAA AAGTTCGCAG GTTGAGATCC TAAAGAAAGT GGAGAAGG/^^ 6900 

GTGTCTAACA TTGAATGCCA GAAACGTAAA GACAAAAGAT CTCCGAAAAA GAGCATTTAC 6960 

ACCATCGACG CTTATTTAAT GCATCACCGT GGTTGTGATG CAGACGTTCT TTTCATCGAT 7 020 

GAGTGTTTCA TGGTTCATGC GGGTAGCGTA CTAGCTTGCA TTGAGTTCAC GAGGTGTCAT 7080 

AAAGTAATGA TCTTCGGGGA TAGCCGGCAG ATTCACTACA TTGATVAGGAA CGAATTGGAC 7140 

AAGTGTTTGT ATGGGGATCT CGACAGGTTC GTGGACCTGC AGTGTCGGGT TTATGGTAAT 7200 

ATTTCGTACC GTTGTCCATG GGATGTGTGC GCTTGGTTAA GCACAGTGTA TGGC7VACCTA 7 2 60 

ATCGCCACCG TG/^GGGTGA AAGCGAAGGT AAGAGCAGCA TGCGCATTAA CGAAATTAAT 7320 

TCAGTCGACG ATTTAGTCCC CGACGTGGGT TCCACGTTTC TGTGTATGCT TCAGTCGGAG 7 380 

AAGTTGGAAA TCAGCAAGCA CTTTATTCGC AAGGGTTTGA CTAAACTTAA CGTTCTAACG 74 4 0 

GTGCATGAGG CGCAAGGTGA GACGTATGCG CGTGTGAACC TTGTGCGACT TT^GTTTCAG 7500 

GAGGATGAAC CCTTTAAATC TATCAGGCAC ATAACCGTCG CTCTTTCTCG TCACACCGAC 7 560 

AGCTTAACTT ATAACGTCTT AGCTGCTCGT CGAGGTGACG CCACTTGCGA TGCCATCCAG 7 620 
AAGGCTGCGG AATTGGTGAA CAAGTTTCGC GTTTTTCCTA CATCTTTTGG TGGTAGTGTT 7680 
ATCAATCTCA ACGTGAAGAA GGACGTGGAA GATAACAGTA GGTGC7VAGGC TTCGTCGGCA 77 4 0 
CCATTGAGCG TAATCAACGA CTTTTTGAAC GAAGTTAATC CCGGTACTGC GGTGATTGAT 7 800 



wo 98/53055 



-25- 



PCTAJS98/10313 



TTTGGTGATT TGTCCGCGGA CTTCAGTACT GGGCCTTTTG AGTGCGGTGC CAGCGGTATT 7 8 60 

GTGGTGCGGG ACAACATCTC CTCCAGC7VAC ATCACTGATC ACGATT^GCA GCGTGTTTAG 7 920 



The large polyprotein (papain-Iike proteases, methyllransferase, and helicase) has an 
amino acid sequence corresponding to SEQ. ID. No. 3 as follows: 



Thr Leu Arg Glu Asn Pro He Ser Val Ser Gly Val Asn Leu Gly Arg 
15 10 15 

Ser Ala Ala Ala Gin Val He Tyr Phe Gly Ser Phe Thr Gin Pro Phe 
20 25 30 

Ala Leu Tyr Pro Arg Gin Glu Ser Ala He Val Lys Thr Gin Leu Pro 
35 40 45 

Pro Val Ser Val Val Lys Val Glu Cys Val Ala Ala Glu Val Ala Pro 
50 55 60 

Asp Arg Gly Val Val Asp Lys Lys Pro Thr Ser Val Gly Val Pro Pro 

65 70 75 80 

Gin Arg Gly Val Leu Ser Phe Pro Thr Val Val Arg Asn Arg Gly Asp 
85 90 95 



Val He He Thr Gly Val Val His 
100 

Gly Leu Leu Arg Phe Arg Val Gly 

115 120 

Phe Ser Ser Asn Tyr Gly Cys Arg 
130 135 

Thr Thr Val Trp Leu Asn Cys Thr 
145 150 

Leu Ala Ala Ala Cys Thr Ala Asp 
166 

Cys Gly Gly Lys Phe Pro Leu Val 
180 



Glu Ala Leu Lys Lys He Lys Asp 
105 110 

Gly Asp Met Arg Phe Ser Arg Phe 
125 

Phe Val Ala Ser Val Arg Thr Asn 
140 

Lys Ala Ser Gly Glu Lys Phe Ser 

155 160 

Tyr Val Ala Met Leu Arg Tyr Val 
170 175 

Leu Met Ser Arg Val He Tyr Pro 
185 190 



Asp Gly Arg Cys Tyr 
195 

Cys Arg Pro Phe Arg 
210 

Val Ala Arg Leu Arg 
225 

Cys Gly He Ala Leu 
245 



Leu Ala His Met Arg 
200 

Glu Ser Asp Tyr Ala 
215 

Ala Cys Val Glu Lys 
230 

Arg Gly Tyr Tyr Thr 
250 



Tyr Leu Cys Ala Phe Tyr 
205 

Leu Gly Met Trp Pro Thr 
220 

Asn Phe Gly Val Glu Ala 

235 240 

Ser Arg Asn Val Tyr His 
255 



Cys Asp Tyr Asp Ser Ala Tyr Val Lys Tyr Phe Arg Asn Leu Ser Gly 
260 265 270 
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Arg lie Gly Gly 
275 

Thr Val Lys lie 
290 

Phe Gly Ala Phe 
305 

Gly Gly He Gin 



Thr Val Glu Glu 
340 

Lys Arg He Asp 
355 

Ser His Leu Leu 
370 

Ala Gly Leu Leu 
385 

Ser Phe Ser Thr 



Ser Arg Ser Val 
420 

Arg Tyr Asp Val 
435 

Phe Ala Ala Gly 
450 

Tyr Pro Asn Gly 
4 65 

Phe Leu Leu Arg 



Pro Thr Ala Ala 
500 

Arg Ser Leu Gly 
515 

Phe His Cys Asp 
530 

Ser Ala Val He 
545 

Asp He Thr Pro 



Arg Leu Tyr Gly 
580 



Gly Ser Phe Asp 
280 

Ser Gly Leu Pro 
295 

Leu Cys Asp He 
310 

Ser Ser Val Lys 
325 

Arg Ala Ala Gly 



Glu Lys Gly Cys 
360 

Val Gly Asn Leu 
375 

Arg Phe Arg Val 
390 

Gin Ala Gly His 
405 

Cys Leu Glu Leu 



Leu Pro Cys Ser 
440 

Gly Arg Phe Pro 
455 

Phe Cys Tyr Leu 
470 

Gly Phe Asp Pro 
485 

Lys Leu Arg Asn 



Leu Asn Leu Tyr 
520 

Tyr Asp Ala Lys 
535 

Ala Gly Lys Asp 
550 

Ala Met Lys Gin 
565 

Gly Thr Asp Ser 



Pro Thr Ser Leu 



Gly Gly Leu Pro 
300 

Arg Tyr Val Glu 
315 

Thr Lys Arg Glu 
330 

Gly Ser Val Glu 
345 

Gly Arg Val Pro 



Asn Glu Val Arg 
380 

Gly Gly Asp Met 
395 

Arg Leu Leu Val 
410 

Tyr Ser Pro Ser 
425 

Gly Asp Tyr Ala 



Leu Val Leu Met 
4 60 

Ala His Cys Arg 
475 

Lys Arg Phe Asp 
490 

Arg Met Val Ser 
505 

Gly Ala Tyr Thr 



Phe He Lys Asp 
540 

Gly Val Glu Glu 
555 

Lys Thr He Glu 
570 

Leu Leu Lys Leu 
585 



Thr Ser Val He 
285 

Lys Asn He Ala 



Pro Val Asp Ser 
320 

Asp Ala His Arg 
335 

Gin Pro Arg Gin 

350 

Ser Gly Gly Phe 
365 

Arg Lys Val Ala 



Asp Phe His Arg 
400 

Trp Arg Arg Ser 
415 

Lys Asn Phe Leu 
430 

Ala Met Phe Ser 
445 

Thr Arg lie Arg 



Tyr Ala Cys Ala 
480 

He Gly Ala Phe 
495 

Glu Leu Gly Glu 
510 

Ser Arg Gly Val 

525 

Leu Arg Leu Met 



Val Val Pro Ser 
560 

Ala Val Tyr Asp 
575 

Ser He Glu Lys 
590 
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Asp Leu lie Asp 
595 

Pro lie Val Lys 
610 

Leu Thr Arg Phe 
625 

His Ser Asp His 



Thr Leu Val Arg 
660 

Pro Leu Phe His 
675 

Arg Pro Val Leu 
690 

Leu Gin Tyr Ser 
705 

Gly Pro Arg Asn 



His Glu Ser Ser 
740 

Tyr Glu lie Cys 
755 

Leu Thr Met Val 
770 

Tyr Met Glu Ser 
786 

Val Val Met Tyr 



lie lie Lys Asp 
820 

Leu Phe Ser Val 
835 

Lys lie Thr Lys 
850 

Leu Arg Tyr Arg 
865 

Arg Phe Asp Lys 



Tyr Leu Asp Ser 
900 



Phe Lys Asn Asp 
600 

Val Pro Phe Tyr 
615 

Tyr Pro Gin Phe 
630 

Pro Ala Ala Ala 
645 

Leu Cys Gly Asn 



Leu His Ser Lys 
680 

Asp Gly Lys Asp 
695 

Asn Val Arg Leu 
710 

lie Asp lie Cys 
725 

Ala Met Met Met 



Gly Ala Met lie 
760 

Thr Pro Gly Glu 
775 

Leu Asp Cys Glu 
790 

Lys Phe Gly Ser 
805 

lie Met Thr Thr 



Glu Met Tyr Glu 
840 

Ser Glu Val Ser 
855 

Arg Ala Asn Ser 
870 

Lys Arg Arg Met 
885 

Lys Phe Val Ser 



Val Gin Ser Leu 



Met Ser Glu Ala 
620 

Glu Leu Lys Phe 
635 

Ala Ser Arg Leu 
650 

Ser Val Ser Asp 
665 

Thr Gin Arg Arg 



Ala Gin Arg Arg 
700 

Gly Asp Asp Asp 
715 

His Tyr Pro Leu 
730 

Val Gin Val Tyr 
745 

Lys Lys Lys Ser 



Phe Leu Asp Gly 
780 

He Glu Val Asp 
795 

Ser Cys Tyr Ser 
810 

Pro Tyr Leu Thr 
825 

Val Arg Met Gly 



Pro Ser He Ser 
860 

Asp Val Val Lys 
875 

Cys Leu Pro Gly 
890 

Arg Val Phe Asp 
905 



Lys Lys Asp Arg 
605 

Thr Gin Asn Ser 



Ser His Ser Ser 
640 

Leu Glu Asn Glu 
655 

He Gly Gly Cys 
670 

Val His Val Cys 
685 

Val Val Arg Asp 



Lys He Leu Glu 
720 

Gly Ala Cys Asp 
735 

Asp Ala Ser Leu 
750 

Arg He Thr Tyr 
765 

Arg Glu Cys Val 



Val His Ala Asp 
800 

His Lys Leu Ser 
815 

Leu Gly Gly Phe 
830 

Val Asn Tyr Phe 
845 

Cys Thr Lys Leu 



Val Lys Leu Pro 
880 

Tyr Asp Thr He 
895 

Tyr Val Val Cys 
910 
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Asn Cys Ser Ala Val Asn Ser Lys Thr Phe Glu Trp Val Trp Ser Phe 
915 920 925 

lie Lys Ser Ser Lys Ser Arg Val He He Ser Gly Lys He He His 
930 935 940 

Lys Asp Val Asn Leu Asp Leu Lys Tyr Val Glu Ser Phe Ala Ala Val 
945 950 955 960 

Met Leu Ala Ser Gly Val Arg Ser Arg Leu Ala Ser Glu Tyr Leu Ala 
965 970 975 

Lys Asn Leu Ser His Phe Ser Gly Asp Cys Ser Phe He Glu Ala Thr 
980 985 990 

Ser Phe Val Leu Arg Glu Lys He Arg Asn Met Thr Leu Asn Phe Asn 
995 1000 1005 

Glu Arg Leu Leu Gin Leu Val Lys Arg Val Ala Phe Ala Thr Leu Asp 
1010 1015 1020 

Val Ser Phe Leu Asp Leu Asp Ser Thr Leu Glu Ser He Thr Asp Phe 
1025 1030 1035 1040 

Ala Glu Cys Lys Val Ala He Glu Leu Asp Glu Leu Gly Cys Leu Arg 
1045 1050 1055 

Ala Glu Ala Glu Asn Glu Lys He Arg Asn Leu Ala Gly Asp Ser He 
1060 1065 1070 

Ala Ala Lys Leu Ala Ser Glu He Val Val Asp He Asp Ser Lys Pro 
1075 1080 1085 

Ser Pro Lys Gin Val Gly Asn Ser Ser Ser Glu Asn Ala Asp Lys Arg 
1090 1095 1100 

Glu Val Gin Arg Pro Gly Leu Arg Gly Gly Ser Arg Asn Gly Val Val 
1105 1110 1115 1120 

Gly Glu Phe Leu His Phe Val Val Asp Ser Ala Leu Arg Leu Phe Lys 
1125 1130 1135 

Tyr Ala Thr Asp Gin Gin Arg He Lys Ser Tyr Val Arg Phe Leu Asp 
1140 1145 1150 

Ser Ala Val Ser Phe Leu Asp Tyr Asn Tyr Asp Asn Leu Ser Phe He 
1155 1160 1165 

Leu Arg Val Leu Ser Glu Gly Tyr Ser Cys Met Phe Ala Phe Leu Ala 
1170 1175 1180 

Asn Arg Gly Asp Leu Ser Ser Arg Val Arg Ser Ala Val Cys Ala Val 
1185 1190 1195 1200 

Lys Glu Val Ala Thr Ser Cys Ala Asn Ala Ser Val Ser Lys Ala Lys 
1205 1210 1215 

Val Met He Thr Phe Ala Ala Ala Val Cys Ala Met Met Phe Asn Ser 
1220 1225 1230 
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Cys Gly Phe Ser Gly Asp Gly Arg Glu Tyr Lys Ser Tyr He His Arg 

1235 1240 1245 

Tyr Thr Gin Val Leu Phe Asp Thr He Phe Phe Glu Asp Ser Ser Tyr 

1250 1255 1260 

Leu Pro He Glu Val Leu Ser Ser Ala He Cys Gly Ala He Val Thr 

1265 1270 1275 1280 

Leu Phe Ser Ser Gly Ser Ser He Ser Leu Asn Ala Phe Leu Leu Gin 

1285 1290 1295 



He Thr Lys Gly Phe Ser Leu Glu Val Val Val Arg Asn Val Val Arg 
1300 1305 1310 

Val Thr His Gly Leu Ser Thr Thr Ala Thr Asp Gly Val He Arg Gly 
1315 1320 1325 

Val Phe Ser Gin He Val Ser His Leu Leu Val Gly Asn Thr Gly Asn 
1330 1335 1340 

Val Ala Tyr Gin Ser Ala Phe He Ala Gly Val Val Pro Leu Leu Val 
1345 1350 1355 1360 

Lys Lys Cys Val Ser Leu He Phe He Leu Arg Glu Asp Thr Tyr Ser 
1365 1370 1375 

Gly Phe He Lys His Gly He Ser Glu Phe Ser Phe Leu Ser Ser He 
1380 1385 1390 

Leu Lys Phe Leu Lys Gly Lys Leu Val Asp Glu Leu Lys Ser He He 
1395 1400 1405 

Gin Gly Val Phe Asp Ser Asn Lys His Val Phe Lys Glu Ala Thr Gin 
1410 1415 1420 

Glu Ala He Arg Thr Thr Val Met Gin Val Pro Val Ala Val Val Asp 
1425 1430 1435 1440 

Ala Leu Lys Ser Ala Ala Gly Lys He Tyr Asn Asn Phe Thr Ser Arg 
1445 1450 1455 

Arg Thr Phe Gly Lys Asp Glu Gly Ser Ser Ser Asp Gly Ala Cys Glu 
1460 1465 1470 

Glu Tyr Phe Ser Cys Asp Glu Gly Glu Gly Pro Gly Leu Lys Gly Gly 
1475 1480 1485 

Ser Ser Tyr Gly Phe Ser He Leu Ala Phe Phe Ser Arg He Met Trp 
1490 1495 1500 

Gly Ala Arg Arg Leu He Val Lys Val Lys His Glu Cys Phe Gly Lys 
1505 1510 1515 1520 

Leu Phe Glu Phe Leu Ser Leu Lys Leu His Glu Phe Arg Thr Arg Val 
1525 1530 1535 



Phe Gly Lys Asn Arg Thr Asp Val Gly Val Tyr Asp Phe Leu Pro Thr 
1540 1545 1550 
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Gly lie Val Glu Thr Leu Ser Ser lie Glu Glu Cys Asp Gin lie Glu 
1555 1560 1565 

Glu Leu Leu Gly Asp Asp Leu Lys Gly Asp Lys Asp Ala Ser Leu Thr 
1570 1575 1580 

Asp Met Asn Tyr Phe Glu Phe Ser Glu Asp Phe Leu Ala Ser lie Glu 
1585 1590 1595 1600 

Glu Pro Pro Phe Ala Gly Leu Arg Gly Gly Ser Lys Asn lie Ala lie 
1605 1610 1615 

Leu Ala lie Leu Glu Tyr Ala His Asn Leu Phe Arg lie Val Ala Ser 
1620 1625 1630 

Lys Cys Ser Lys Arg Pro Leu Phe Leu Ala, Phe Ala Glu Leu Ser Ser 
1635 1640 1645 

Ala Leu lie Glu Lys Phe Lys Glu Val Phe Pro Arg Lys Ser Gin Leu 
1650 1655 1660 

Val Ala He Val Arg Glu Tyr Thr Gin Arg Phe Leu Arg Ser Arg Met 
1665 1670 1675 1680 

Arg Ala Leu Gly Leu Asn Asn Glu Phe Val Val Lys Ser Phe Ala Asp 
1685 1690 1695 

Leu Leu Pro Ala Leu Met Lys Arg Lys Val Ser Gly Ser Phe Leu Ala 
1700 1705 1710 

Ser Val Tyr Arg Pro Leu Arg Gly Phe Ser Tyr Met Cys Val Ser Ala 
1715 1720 1725 

Glu Arg Arg Glu Lys Phe Phe Ala Leu Val Cys Leu He Gly Leu Ser 
1730 1735 1740 

Leu Pro Phe Phe Val Arg He Val Gly Ala Lys Ala Cys Glu Glu Leu 
1745 1750 1755 1760 

Val Ser Ser Ala Arg Arg Phe Tyr Glu Arg He Lys He Phe Leu Arg 
1765 1770 1775 

Gin Lys Tyr Val Ser Leu Ser Asn Phe Phe Cys His Leu Phe Ser Ser 
1780 1785 1790 

Asp Val Asp Asp Ser Ser Ala Ser Ala Gly Leu Lys Gly Gly Ala Ser 
1795 1800 1805 

Arg Met Thr Leu Phe His Leu Leu Val Arg Leu Ala Ser Ala Leu Leu 
1810 1815 1820 

Ser Leu Gly Trp Glu Gly Leu Lys Leu Leu Leu Ser His His Asn Leu 
1825 1830 1835 1840 

Leu Phe Leu Cys Phe Ala Leu Val Asp Asp Val Asn Val Leu He Lys 
1845 1850 1855 

Val Leu Gly Gly Leu Ser Phe Phe Val Gin Pro He Phe Ser Leu Phe 
1860 1865 1870 
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Ala Ala Met Leu Leu Gin Pro Asp Arg Phe Val Glu Tyr Ser Glu Lys 
1875 1880 1885 

Leu Val Thr Ala Phe Glu Phe Phe Leu Lys Cys Ser Pro Arg Ala Pro 
1890 1895 1900 

Ala Leu Leu Lys Gly Phe Phe Glu Cys Val Ala Asn Ser Thr Val Ser 
1905 1910 1915 1920 

Lys Thr Val Arg Arg Leu Leu Arg Cys Phe Val Lys Met Leu Lys Leu 
1925 1930 1935 

Arg Lys Gly Arg Gly Leu Arg Ala Asp Gly Arg Gly Leu His Arg Gin 
1940 1945 ' 1950 

Lys Ala Val Pro Val lie Pro Ser Asn Arg Val Val Thr Asp Gly Val 
1955 1960 1965 

Glu Arg Leu Ser Val Lys Met Gin Gly Val Glu Ala Leu Arg Thr Glu 
1970 1975 1980 

Leu Arg lie Leu Glu Asp Leu Asp Ser Ala Val lie Glu Lys Leu Asn 
1985 1990 1995 2000 

Arg Arg Arg Asn Arg Asp Thr Asn Asp Asp Glu Phe Thr Arg Pro Ala 
2005 2010 2015 

His Glu Gin Met Gin Glu Val Thr Thr Phe Cys Ser Lys Ala Asn Ser 
2020 2025 2030 

Ala Gly Leu Ala Leu Glu Arg Ala Val Leu Val Glu Asp Ala lie Lys 
2035 2040 2045 

Ser Glu Lys Leu Ser Lys Thr Val Asn Glu Met Val Arg Lys Gly Ser 
2050 2055 2060 

Thr Thr Ser Glu Glu Val Ala Val Ala Leu Ser Asp Asp Glu Ala Val 
2065 2070 2075 2080 

Glu Glu lie Ser Val Ala Asp Glu Arg Asp Asp Ser Pro Lys Thr Val 
2085 2090 2095 

Arg lie Ser Glu Tyr Leu Asn Arg Leu Asn Ser Ser Phe Glu Phe Pro 
2100 2105 2110 

Lys Pro lie Val Val Asp Asp Asn Lys Asp Thr Gly Gly Leu Thr Asn 
2115 2120 2125 

Ala Val Arg Glu Phe Tyr Tyr Met Gin Glu Leu Ala Leu Phe Glu lie 
2130 2135 2140 

His Ser Lys Leu Cys Thr Tyr Tyr Asp Gin Leu Arg lie Val Asn Phe 
2145 2150 2155 2160 

Asp Arg Ser Val Ala Pro Cys Ser Glu Asp Ala Gin Leu Tyr Val Arg 
2165 2170 2175 

Lys Asn Gly Ser Thr He Val Gin Gly Lys Glu Val Arg Leu His He 
2180 2185 2190 
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Lys Asp Phe His Asp His Asp Phe Leu Phe Asp Gly Lys lie Ser lie 
2195 2200 2205 

Asn Lys Arg Arg Arg Gly Gly Asn Val Leu Tyr His Asp Asn Leu Ala 
2210 2215 2220 

Phe Leu Ala Ser Asn Leu Phe Leu Ala Gly Tyr Pro Phe Ser Arg Ser 
2225 2230 2235 2240 

Phe Val Phe Thr Asn Ser Ser Val Asp He Leu Leu Tyr Glu Ala Pro 
2245 2250 2255 

Pro Gly Gly Gly Lys Thr Thr Thr Leu He Asp Ser Phe Leu Lys Val 
2260 2265 2270 

Phe Lys Lys Gly Glu Val Ser Thr Met He Leu Thr Ala Asn Lys Ser 
2275 2280 2285 

Ser Gin Val Glu He Leu Lys Lys Val Glu Lys Glu. Val Ser Asn He 
2290 2295 2300 

Glu Cys Gin Lys Arg Lys Asp Lys Arg Ser Pro Lys Lys Ser He Tyr 
2305 2310 2315 2320 

Thr He Asp Ala Tyr Leu Met His His Arg Gly Cys Asp Ala Asp Val 
2325 2330 2335 

Leu Phe He Asp Glu Cys Phe Met Val His Ala Gly Ser Val Leu Ala 
2340 2345 2350 

Cys He Glu Phe Thr Arg Cys His Lys Val Met He Phe Gly Asp Ser 
2355 2360 2365 

Arg Gin He His Tyr He Glu Arg Asn Glu Leu Asp Lys Cys Leu Tyr 
2370 2375 2380 

Gly Asp Leu Asp Arg Phe Val Asp Leu Gin Cys Arg Val Tyr Gly Asn 
2385 2390 2395 2400 

He Ser Tyr Arg Cys Pro Trp Asp Val Cys Ala Trp Leu Ser Thr Val 
2405 2410 2415 

Tyr Gly Asn Leu He Ala Thr Val Lys Gly Glu Ser Glu Gly Lys Ser 
2420 2425 2430 

Ser Met Arg He Asn Glu He Asn Ser Val Asp Asp Leu Val Pro Asp 
2435 2440 2445 

Val Gly Ser Thr Phe Leu Cys Met Leu Gin Ser Glu Lys Leu Glu He 
2450 2455 2460 

Ser Lys His Phe He Arg Lys Gly Leu Thr Lys Leu Asn Val Leu Thr 
2465 2470 2475 2480 

Val His Glu Ala Gin Gly Glu Thr Tyr Ala Arg Val Asn Leu Val Arg 
2485 2490 2495 

Leu Lys Phe Gin Glu Asp Glu Pro Phe Lys Ser He Arg His He Thr 
2500 2505 2510 
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Val Ala Leu Ser Arg His Thr Asp Ser Leu Thr Tyr Asn Val Leu Ala 
2515 2520 2525 

Ala Arg Arg Gly Asp Ala Thr Cys Asp Ala lie Gin Lys Ala Ala Glu 
2530 2535 2540 

Leu Val Asn Lys Phe Arg Val Phe Pro Thr Ser Phe Gly Gly Ser Val 
2545 2550 2555 2560 

lie Asn Leu Asn Val Lys Lys Asp Val Glu Asp Asn Ser Arg Cys Lys 
2565 2570 2575 

Ala Ser Ser Ala Pro Leu Ser Val lie Asn Asp Phe Leu Asn Glu Val 
2580 2585 2590 

Asn Pro Gly Thr Ala Val lie Asp Phe Gly Asp Leu Ser Ala Asp Phe 
2595 2600 2605 

Ser Thr Gly Pro Phe Glu Cys Gly Ala Ser Gly lie Val Val Arg Asp 
2610 2615 2620 

Asn lie Ser Ser Ser Asn lie Thr Asp His Asp Lys Gin Arg Val 
2625 2630 2635 

and has a molecular weight of about 290 to 300 kDa, preferably 294 kDa. 

Another such DNA molecule (GLRaV-2 ORFlb) includes nucleotides 
7922-9301 of SEQ, ID. No. 1 and codes for a grapevine leafroll virus RNA-dependent 
RNA polymerase (RdRP). This DNA molecule comprises the nucleotide sequence 
corresponding to SEQ. ID. No. 4 as follows: 



AGCGTAGTTC 


GGTCGCAGGC 


GATTCCGCGT 


AGAAAACCTT 


CTCTACAAGA 


AAATTTGTAT 


60 


TCGTTTGAAG 


CGCGGAATTA 


TAACTTCTCG 


ACTTGCGACC 


GTAACACATC 


TGCTTCAATG 


120 


TTCGGAGAGG 


CTATGGCGAT 


GAACTGTCTT 


CGTCGTTGCT 


TCGACCTAGA 


TGCCTTTTCG 


180 


TCCCTGCGTG 


ATGATGTGAT 


TAGTATCACA 


CGTTCAGGCA 


TCGAACAATG 


GCTGGAGAAA 


240 


CGTACTCCTA 


GTCAGATTAA 


AGCATTAATG 


AAGGATGTTG 


AATCGCCTTT 


GGAAATTGAC 


300 


GATGAAATTT 


GTCGTTTTT^ 


GTTGATGGTG 


AAGCGTGACG 


CTAAGGTGAA 


GTTAGACTCT 


360 


TCTTGTTTAA 


CTAAACACAG 


CGCCGCTCAA 


AATATCATGT 


TTCATCGCAA 


GAGCATTAAT 


420 


GCTATCTTCT 


CTCCTATCTT 


TT^TGAGGTG 


AAAAACCG7VA 


TAATGTGCTG 


TCTTAAGCCT 


480 


AACATAAAGT 


TTTTTACGGA 


GATGACTAAC 


AGGGATTTTG 


CTTCTGTTGT 


CAGCAACATG 


540 


CTTGGTGACG 


ACGATGTGTA 


CCATATAGGT 


GAAGTTGATT 


TCTCAAAGTA 


CGACAAGTCT 


600 


CAAGATGCTT 


TCGTGAAGGC 


TTTTGAAGAA 


GTAATGTATA 


AGGAACTCGG 


TGTTGATGAA 


660 


GAGTTGCTGG 


CTATCTGGAT 


GTGCGGCGAG 


CGGTTATCGA 


TAGCTAACAC 


TCTCGATGGT 


720 


CAGTTGTCCT 


TCACGATCGA 


GAATCAAAGG 


AAGTCGGGAG 


CTTCGAACAC 


TTGGATTGGT 


780 
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AACTCTCTCG TCACTTTGGG TATTTTAAGT CTTTACTACG ACGTTAGAAA TTTCGAGGCG 84 0 

TTGTACATCT CGGGCGATGA TTCTTTAATT TTTTCTCGCA GCGAGATTTC GAATTATGCC 900 

GACGACATAT GCACTGACAT GGGTTTTGAG ACAAAATTTA TGTCCCCAAG TGTCCCGTAC 960 

TTTTGTTCTA AATTTGTTGT TATGTGTGGT CATAAGACGT TTTTTGTTCC CGACCCGTAC 1020 

AAGCTTTTTG TCAAGTTGGG AGCAGTCAAA GAGGATGTTT CAATGGATTT CCTTTTCGAG 1080 

ACTTTTACCT CCTTTAAAGA CTTAACCTCC GATTTTAACG ACGAGCGCTT AATTCAAAAG 114 0 

CTCGCTGAAC TTGTGGCTTT AAAATATGAG GTTCAAACCG GCAACACCAC CTTGGCGTTA 1200 

AGTGTGATAC ATTGTTTGCG TTCGAATTTC CTCTCGTTTA GCAAGTTATA TCCTCGCGTG 12 60 

AAGGGATGGC AGGTTTTTTA CACGTCGGTT AAGAAAGCGC TTCTCAAGAG TGGGTGTTCT 1320 

CTCTTCGACA GTTTCATGAC CCCTTTTGGT CAGGCTGTCA TGGTTTGGGA TGATGAGTAG 138 0 

The RNA-dependent RNA polymerase has an amino acid sequence corresponding to 
SEQ. ID. No. 5 as follows: 



Ser Val Val Arg Ser Gin Ala lie Pro Arg Arg Lys Pro Ser Leu Gin 
15 10 15 

Glu Asn Leu Tyr Ser Phe Glu Ala Arg Asn Tyr Asn Phe Ser Thr Cys 
20 25 30 

Asp Arg Asn Thr Ser Ala Ser Met Phe Gly Glu Ala Met Ala Met Asn 
35 40 45 

Cys Leu Arg Arg Cys Phe Asp Leu Asp Ala Phe Ser Ser Leu Arg Asp 
50 55 60 

Asp Val lie Ser lie Thr Arg Ser Gly lie Glu Gin Trp Leu Glu Lys 
65 70 75 80 

Arg Thr Pro Ser Gin lie Lys Ala Leu Met Lys Asp Val Glu Ser Pro 
85 90 95 

Leu Glu lie Asp Asp Glu lie Cys Arg Phe Lys Leu Met Val Lys Arg 
100 105 110 

Asp Ala Lys Val Lys Leu Asp Ser Ser Cys Leu Thr Lys His Ser Ala 
115 120 125 

Ala Gin Asn lie Met Phe His Arg Lys Ser lie Asn Ala lie Phe Ser 
130 135 140 

Pro lie Phe Asn Glu Val Lys Asn Arg lie Met Cys Cys Leu Lys Pro 
145 150 155 160 

Asn lie Lys Phe Phe Thr Glu Met Thr Asn Arg Asp Phe Ala Ser Val 
165 170 175 

Val Ser Asn Met Leu Gly Asp Asp Asp Val Tyr His He Gly Glu Val 
180 185 190 
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Asp Phe Ser Lys Tyr Asp Lys Ser Gin Asp Ala Phe Val Lys Ala Phe 
195 200 205 

Glu Glu Val Met Tyr Lys Glu Leu Gly Val Asp Glu Glu Leu Leu Ala 
210 215 220 

lie Trp Met Cys Gly Glu Arg Leu Ser lie Ala Asn Thr Leu Asp Gly 
225 230 235 240 

Gin Leu Ser Phe Thr lie Glu Asn Gin Arg Lys Ser Gly Ala Ser Asn 
245 250 255 

Thr Trp lie Gly Asn Ser Leu Val Thr Leu Gly He Leu Ser Leu Tyr 
260 265 270 

Tyr Asp Val Arg Asn Phe Glu Ala Leu Tyr He Ser Gly Asp Asp Ser 
275 280 285 

Leu He Phe Ser Arg Ser Glu He Ser Asn Tyr Ala Asp Asp He Cys 
290 295 300 

Thr Asp Met Gly Phe Glu Thr Lys Phe Met Ser Pro Ser Val Pro Tyr 
305 310 315 320 

Phe Cys Ser Lys Phe Val Val Met Cys Gly His Lys Thr Phe Phe Val 
325 330 335 

Pro Asp Pro Tyr Lys Leu Phe Val Lys Leu Gly Ala Val Lys Glu Asp 
340 345 350 

Val Ser Met Asp Phe Leu Phe Glu Thr Phe Thr Ser Phe Lys Asp Leu 
355 360 365 

Thr Ser Asp Phe Asn Asp Glu Arg Leu He Gin Lys Leu Ala Glu Leu 
370 375 380 

Val Ala Leu Lys Tyr Glu Val Gin Thr Gly Asn Thr Thr Leu Ala Leu 
385 390 395 400 

Ser Val He His Cys Leu Arg Ser Asn Phe Leu Ser Phe Ser Lys Leu 
405 410 415 

Tyr Pro Arg Val Lys Gly Trp Gin Val Phe Tyr Thr Ser Val Lys Lys 
420 425 430 

Ala Leu Leu Lys Ser Gly Cys Ser Leu Phe Asp Ser Phe Met Thr Pro 
435 440 445 

Phe Gly Gin Ala Val Met Val Trp Asp Asp Glu 
450 455 

and a molecular weight from about 50 to about 54 kDa, preferably about 52 kDa. 

Another such DNA molecule (GLRAV-2 0RF2) includes nucleotides 
9365-9535 of SEQ. ID. No. 1 and codes for a small, grapevine leafroll virus hydrophobic 
protein or polypeptide. This DNA molecule comprises the nucleotide sequence 
corresponding to SEQ. ID. No. 6 as follows: 
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ATGAATCAGG TTTTGCAGTT TGAATGTTTG TTTCTGCTGA ATCTCGCGGT TTTTGCTGTG 60 
ACTTTCATTT TCATTCTTCT GGTCTTCCGC GTGATTAAGT CTTTTCGCCA GAAGGGTCAC 120 
GAAGCACCTG TTCCCGTTGT TCGTGGCGGG GGTTTTTCAA CCGTAGTGTA G 171 

The small hydrophobic protein or polypeptide has an amino acid sequence corresponding 
to SEQ. ID. No. 7 as follows: 

Met Asn Gin Val Leu Gin Phe Glu Cys Leu Phe Leu Leu Asn Leu Ala 
15 10 15 

Val Phe Ala Val Thr Phe lie Phe lie Leu Leu Val Phe Arg Val lie 
20 25 30 

Lys Ser Phe Arg Gin Lys Gly His Glu Ala Pro Val Pro Val Val Arg 
35 40 45 

Gly Gly Gly Phe Ser Thr Val Val 
50 55 

and a molecular weight from about 5 to about 7 kDa, preferably about 6 kDa. 

Another such DNA molecule (GLRaV-2 ORF3) includes nucleotides 
9551-1 1350 of SEQ. ID. No. 1 and encodes for a grapevine leafroll virus heat shock 70 
protein. This DNA molecule comprises the nucleotide sequence corresponding to SEQ. 
ID. No. 8 as follows: 



ATGGTAGTTT 


TCGGTTTGGA 


CTTTGGCACC 


ACATTCTCTA 


CGGTGTGTGT 


GTACAAGGAT 


60 


GGACGAGTTT 


TTTCATTCAA 


GCAGAATAAT 


TCGGCGTACA 


TCCCCACTTA 


CCTCTATCTC 


120 


TTCTCCGATT 


CTAACCACAT 


GACTTTTGGT 


TACGAGGCCG 


T^TCACTGAT 


GAGTAATCTG 


180 


AAAGTTAAAG 


GTTCGTTTTA 


TAGAGATTTA 


AAACGTTGGG 


TGGGTTGCGA 


TTCGAGT/^C 


240 


CTCGACGCGT 


ACCTTGACCG 


TTTAAAACCT 


CATTACTCGG 


TCCGCTTGGT 


TAAGATCGGC 


300 


TCTGGCTTGA 


ACGAAACTGT 


TTCAATTGGA 


AACTTCGGGG 


GCACTGTTAA 


GTCTGAGGCT 


360 


CATCTGCCAG 


GGTTGATAGC 


TCTCTTTATT 


AAGGCTGTCA 


TTAGTTGCGC 


GGAGGGCGCG 


420 


TTTGCGTGCA 


CTTGCACCGG 


GGTTATTTGT 


TCAGTACCTG 


CCAATTATGA 


TAGCGTTCAA 


480 


AGGAATTTCA 


CTGATCAGTG 


TGTTTCACTC 


AGCGGTTATC 


AGTGCGTATA 


TATGATCAAT 


540 


GAACCTTCAG 


CGGCTGCGCT 


ATCTGCGTGT 


AATTCGATTG 


GAAAGAAGTC 


CGCAAATTTG 


600 


GCTGTTTACG 


ATTTCGGTGG 


TGGGACCTTC 


GACGTGTCTA 


TCATTTCATA 


CCGCAACAAT 


660 


ACTTTTGTTG 


TGCGAGCTTC 


TGGAGGCGAT 


CTAAATCTCG 


GTGGAAGGGA 


TGTTGATCGT 


720 


GCGTTTCTCA 


CGCACCTCTT 


CTCTTTAACA 


TCGCTGGAAC 


CTGACCTCAC 


TTTGGATATC 


780 



R 170988. 1 
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TCGAATCTGA AAGAATCTTT ATCAAAAACG GACGCAGAGA TAGTTTACAC TTTGAGAGGT 84 0 

GTCGATGGAA GAAAAGAAGA CGTTAGAGTA AACAAAAACA TTCTTACGTC GGTGATGCTC 900 

CCCTACGTGA ACAGT^CGCT TAAGATATTA GAGTCAACCT TAAAATCGTA TGCTAAGAGT 960 

ATGAATGAGA GTGCGCGAGT TAAGTGCGAT TTAGTGCTGA TAGGAGGATC TTCATATCTT 1020 

CCTGGCCTGG CAGACGTACT AACGAAGCAT CAGAGCGTTG ATCGTATCTT AAGAGTTTCG 1080 

GATCCTCGGG CTGCCGTGGC CGTCGGTTGC GCATTATATT CTTCATGCCT CTCAGGATCT 1140 

GGGGGGTTGC TACTGATCGA CTGTGCAGCT CACACTGTCG CTATAGCGGA CAGAAGTTGT 1200 

CATCAAATCA TTTGCGCTCC AGCGGGGGCA CCGATCCCCT TTTCAGGAAG CATGCCTTTG 12 60 

TACTTAGCCA GGGTCAACAA GAACTCGCAG CGTGAAGTCG CCGTGTTTGA AGGGGAGTAC 1320 

GTTAAGTGCC CTAAGAACAG AAAGATCTGT GGAGCAAATA TAAGATTTTT TGATATAGGA 1380 

GTGACGGGTG ATTCGTACGC ACCCGTTACC TTCTATATGG ATTTCTCCAT TTCAAGCGTA 14 4 0 

GGAGCCGTTT CATTCGTGGT GAGAGGTCCT GAGGGTAAGC AAGTGTCACT CACTGGAACT 1500 

CCAGCGTATA ACTTTTCGTC TGTGGCTCTC GGATCACGCA GTGTCCGAGA ATTGCATATT 15 60 

AGTTTAAATA ATAAAGTTTT TCTCGGTTTG CTTCTACATA GAAAGGCGGA TCGACGAATA 1620 

CTTTTCACTA AGGATGAAGC GATTCGATAC GCCGATTCAA TTGATATCGC GGATGTGCTA 1680 

AAGG7\ATATA AAAGTTACGC GGCCAGTGCC TTACCACCAG ACGAGGATGT CGAATTACTC 17 4 0 

CTGGGAAAGT CTGTTCAAAA AGTTTTACGG GGAAGCAGAC TGGAAGAAAT ACCTCTCTAG 1800 

The heat shock 70 protein is believed to function as a chaperone protein and has an amino 



acid sequence corresponding to SEQ. ID. No. 9 as follows: 



Met Val Val Phe Gly Leu Asp Phe Gly Thr Thr Phe Ser Thr Val Cys 
15 10 15 

Val Tyr Lys Asp Gly Arg Val Phe Ser Phe Lys Gin Asn Asn Ser Ala 
20 25 30 

Tyr lie Pro Thr Tyr Leu Tyr Leu Phe Ser Asp Ser Asn His Met Thr 
35 40 45 

Phe Gly Tyr Glu Ala Glu Ser Leu Met Ser Asn Leu Lys Val Lys Gly 
50 55 60 

Ser Phe Tyr Arg Asp Leu Lys Arg Trp Val Gly Cys Asp Ser Ser Asn 
65 70 75 80 

Leu Asp Ala Tyr Leu Asp Arg Leu Lys Pro His Tyr Ser Val Arg Leu 
85 90 95 

Val Lys lie Gly Ser Gly Leu Asn Glu Thr Val Ser lie Gly Asn Phe 
100 105 110 
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Gly Gly Thr Val Lys Ser Glu Ala His Leu Pro Gly Leu lie Ala Leu 
115 120 125 

Phe He Lys Ala Val He Ser Cys Ala Glu Gly Ala Phe Ala Cys Thr 
130 135 140 

Cys Thr Gly Val He Cys Ser Val Pro Ala Asn Tyr Asp Ser Val Gin 
145 150 155 160 

Arg Asn Phe Thr Asp Gin Cys Val Ser Leu Ser Gly Tyr Gin Cys Val 
165 170 175 

Tyr Met He Asn Glu Pro Ser Ala Ala Ala Leu Ser Ala Cys Asn Ser 
180 185 190 

He Gly Lys Lys Ser Ala Asn Leu Ala Val Tyr Asp Phe Gly Gly Gly 
195 200 205 

Thr Phe Asp Val Ser He He Ser Tyr Arg Asn Asn Thr Phe Val Val 
210 215 220 

Arg Ala Ser Gly Gly Asp Leu Asn Leu Gly Gly Arg Asp Val Asp Arg 
225 230 235 240 

Ala Phe Leu Thr His Leu Phe Ser Leu Thr Ser Leu Glu Pro Asp Leu 
245 250 255 

Thr Leu Asp He Ser Asn Leu Lys Glu Ser Leu Ser Lys Thr Asp Ala 
260 265 270 

Glu He Val Tyr Thr Leu Arg Gly Val Asp Gly Arg Lys Glu Asp Val 
275 280 285 

Arg Val Asn Lys Asn He Leu Thr Ser Val Met Leu Pro Tyr Val Asn 
290 295 300 

Arg Thr Leu Lys He Leu Glu Ser Thr Leu Lys Ser Tyr Ala Lys Ser 
305 310 315 320 

Met Asn Glu Ser Ala Arg Val Lys Cys Asp Leu Val Leu He Gly Gly 
325 330 335 

Ser Ser Tyr Leu Pro Gly Leu Ala Asp Val Leu Thr Lys His Gin Ser 
340 345 350 

Val Asp Arg He Leu Arg Val Ser Asp Pro Arg Ala Ala Val Ala Val 
355 360 365 

Gly Cys Ala Leu Tyr Ser Ser Cys Leu Ser Gly Ser Gly Gly Leu Leu 
370 375 380 

Leu He Asp Cys Ala Ala His Thr Val Ala He Ala Asp Arg Ser Cys 
385 390 395 400 

His Gin He He Cys Ala Pro Ala Gly Ala Pro He Pro Phe Ser Gly 
405 410 415 

Ser Met Pro Leu Tyr Leu Ala Arg Val Asn Lys Asn Ser Gin Arg Glu 
420 425 430 
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Val Ala Val Phe Glu Gly Glu Tyr Val Lys Cys Pro Lys Asn Arg Lys 
435 .440 445 

lie Cys Gly Ala Asn lie Arg Phe Phe Asp He Gly Val Thr Gly Asp 
450 455 460 

Ser Tyr Ala Pro Val Thr Phe Tyr Met Asp Phe Ser He Ser Ser Val 
465 470 475 480 

Gly Ala Val Ser Phe Val Val Arg Gly Pro Glu Gly Lys Gin Val Ser 
485 490 495 

Leu Thr Gly Thr Pro Ala Tyr Asn Phe Ser Ser Val Ala Leu Gly Ser 
500 505 510 

Arg Ser Val Arg Glu Leu His He Ser Leu Asn Asn Lys Val Phe Leu 
515 520 525 

Gly Leu Leu Leu His Arg Lys Ala Asp Arg Arg He Leu Phe Thr Lys 
530 535 540 

Asp Glu Ala He Arg Tyr Ala Asp Ser He Asp He Ala Asp Val Leu 
545 550 555 560 

Lys Glu Tyr Lys Ser Tyr Ala Ala Ser Ala Leu Pro Pro Asp Glu Asp 
565 570 575 

Val Glu Leu Leu Leu Gly Lys Ser Val Gin Lys Val Leu Arg Gly Ser 
580 585 590 

Arg Leu Glu Glu He Pro Leu 
595 

and a molecular weight from about 63 to about 67 kDa, preferably about 65 IcDa. 

Another such DNA molecule (GLRaV-2 ORF4) includes nucleotides 
1 1277-12932 of SEQ. ID. No. 1 and codes for a putative grapevine leafroll virus heat 
shock 90 protein. This DNA molecule comprises a nucleotide sequence corresponding to 
SEQ. ID. No. 10 as follows: 

ATGTCGAATT ACTCCTGGGA AAGTCTGTTC AAA7VAGTTTT ACGGGG7UVGC AGACTGGAAG 60 
AAATACCTCT CTAGGAGCAT AGCAGCACAC TCAAGTGAAA TTAAAACTCT ACCAGACATT 120 
CGATTGTACG GCGGTAGGGT TGTAAAGAAG TCCGAATTCG AATCAGCACT TCCTAATTCT 180 
TTTGAACAGG AATTAGGACT GTTCATACTG AGCGTVACGGG AAGTGGGATG GAGCAAATTA 24 0 

TGCGGAATAA CGGTGGAAGA AGCAGCATAC GATCTTACGA ATCCCAAGGC TTATAAATTC 300 
ACTGCCGAGA CATGTAGCCC GGATGTAATU^ GGTGTVAGGAC AAAAATACTC TATGGAAGAC 360 
GTGATGAATT TCATGCGTTT ATCAAATCTG GATGTTAACG ACAAGATGCT GACGGAACAG 4 20 

TGTTGGTCGC TGTCCAATTC ATGCGGTGAA TTGATCAACC CAGACGACAA AGGGCGATTC 4 80 

GTGGCTCTCA CCTTTAAGGA CAGAGACACA GCTGATGACA CGGGTGCCGC CAACGTGGAA 54 0 
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TGTCGCGTGG GCGACTATCT AGTTTACGCT ATGTCCCTGT TTGAGCAGAG GACCC7VAAAA 600 

TCGCAGTCTG GCAACATCTC TCTGTACGAA TVAGTACTGTG AATACATCAG GACCTACTTA 660 

GGGAGTACAG ACCTGTTCTT CACAGCGCCG GACAGGATTC CGTTACTTAC GGGCATCCTA 7 20 

TACGATTTTT GTAAGGAATA C7\ACGTTTTC TACTCGTCAT ATAAGAGAAA CGTCGAT7VAT 7 80 

TTCAGATTCT TCTTGGCGAA TTATATGCCT TTGATATCTG ACGTCTTTGT CTTCCAGTGG 8 40 

GTAAAACCCG CGCCGGATGT TCGGCTGCTT TTTGAGTTAA GTGCAGCGGA ACTAACGCTG 900 

GAGGTTCCCA CACTGAGTTT GATAGATTCT CAAGTTGTGG TAGGTCATAT CTTTU^GATAC 9 60 

GTAGAATCCT ACACATCAGA TCCAGCCATC GACGCGTTAG AAGACAAACT GGT^GCGATA 102 0 

CTGAAAAGTA GCAATCCCCG TCTATCGACA GCGCAACTAT GGGTTGGTTT CTTTTGTTAC 1080 

TATGGTGAGT TTCGTACGGC TCAAAGTAGA GTAGTGCAAA GACCAGGCGT ATACAAAACA 114 0 

CCTGACTCAG TGGGTGGATT TGAAATAAAC ATGAAAGATG TTGAGAAATT CTTCGATAAA 12 00 

CTTCAGAGAG AATTGCCTAA TGTATCTTTG CGGCGTCAGT TTAACGGAGC TAGAGCGCAT 12 60 

GAGGCTTTCA AAATATTTAA AAACGGAAAT ATAAGTTTCA GACCTATATC GCGTTTAAAC 1320 

GTGCCTAGAG AGTTCTGGTA TCTGAACATA GACTACTTCA GGCACGCGAA TAGGTCCGGG 1380 

TT7VACCGAAG AAGAAATACT CATCCTAAAC AACATAAGCG TTGATGTTAG GAAGTTATGC 14 4 0 

GCTGAGAGAG CGTGCAATAC CCTACCTAGC GCGAAGCGCT TTAGTAAAAA TCATAAGAGT 1500 

AATATACAAT CATCACGCCA AGAGCGGAGG ATTAAAGACC CATTGGTAGT CCTGAAAGAC 15 60 

ACTTTATATG AGTTCCAACA CAAGCGTGCC GGTTGGGGGT CTCGAAGCAC TCGAGACCTC 1620 

GGGAGTCGTG CTGACCACGC GTW^GGAAGC GGTTGA 165 6 

The heat shock 90 protein has an amino acid sequence corresponding to SEQ. ID. No. 1 1 
as follows: 

Met Ser Asn Tyr Ser Trp Glu Ser Leu Phe Lys Lys Phe Tyr Gly Glu 
15 10 15 

Ala Asp Trp Lys Lys Tyr Leu Ser Arg Ser lie Ala Ala His Ser Ser 
20 25 30 

Glu lie Lys Thr Leu Pro Asp lie Arg Leu Tyr Gly Gly Arg Val Val 
35 40 45 

Lys Lys Ser Glu Phe Glu Ser Ala Leu Pro Asn Ser Phe Glu Gin Glu 
50 55 60 

Leu Gly Leu Phe lie Leu Ser Glu Arg Glu Val Gly Trp Ser Lys Leu 
65 70 75 80 
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Cvs Glv He Thr Val Glu Glu Ala Ala Tyr Asp Leu Thr Asn Pro Lys 
85 90 95 

Ala Tyr Lys Phe Thr Ala Glu Thr Cys Ser Pro Asp Val Lys Gly Glu 
100 105 110 

Gly Gin Lys Tyr Ser Met Glu Asp Val Met Asn Phe Met Arg Leu Ser 
115 120 125 

Asn Leu Asp Val Asn Asp Lys Met Leu Thr Glu Gin Cys Trp Ser Leu 
130 135 140 

Ser Asn Ser Cys Gly Glu Leu He Asn Pro Asp Asp Lys Gly Arg Phe 
145 150 155 160 

Val Ala Leu Thr Phe Lys Asp Arg Asp Thr Ala Asp Asp Thr Gly Ala 
165 1^0 175 

Ala Asn Val Glu Cys Arg Val Gly Asp Tyr Leu Val Tyr Ala Met Ser 
180 185 190 

Leu Phe Glu Gin Arg Thr Gin Lys Ser Gin Ser Gly Asn He Ser Leu 
195 200 205 

Tyr Glu Lys Tyr Cys Glu Tyr He Arg Thr Tyr Leu Gly Ser Thr Asp 
210 215 220 

Leu Phe Phe Thr Ala Pro Asp Arg He Pro Leu Leu Thr Gly He Leu 
225 230 235 240 

Tyr Asp Phe Cys Lys Glu Tyr Asn Val Phe Tyr Ser Ser Tyr Lys Arg 
245 250 255 

Asn Val Asp Asn Phe Arg Phe Phe Leu Ala Asn Tyr Met Pro Leu He 
260 265 270 

Ser Asp Val Phe Val Phe Gin Trp Val Lys Pro Ala Pro Asp Val Arg 
275 280 285 

Leu Leu Phe Glu Leu Ser Ala Ala Glu Leu Thr Leu Glu Val Pro Thr 
290 295 300 

Leu Ser Leu He Asp Ser Gin Val Val Val Gly His He Leu Arg Tyr 
305 310 315 320 

Val Glu Ser Tyr Thr Ser Asp Pro Ala He Asp Ala Leu Glu Asp Lys 
325 330 335 

Leu Glu Ala He Leu Lys Ser Ser Asn Pro Arg Leu Ser Thr Ala Gin 
340 345 350 

Leu Trp Val Gly Phe Phe Cys Tyr Tyr Gly Glu Phe Arg Thr Ala Gin 
355 360 365 

Ser Arg Val Val Gin Arg Pro Gly Val Tyr Lys Thr Pro Asp Ser Val 
370 375 380 

Gly Gly Phe Glu He Asn Met Lys Asp Val Glu Lys Phe Phe Asp Lys 
385 390 395 400 
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Leu Gin Arg Glu Leu Pro Asn Val Ser Leu Arg Arg Gin Phe Asn Gly 
405 410 415 

Ala Arg Ala His Glu Ala Phe Lys He Phe Lys Asn Gly Asn He Ser 
420 425 430 

Phe Arg Pro He Ser Arg Leu Asn Val Pro Arg Glu Phe Trp Tyr Leu 
435 440 445 

Asn He Asp Tyr Phe Arg His Ala Asn Arg Ser Gly Leu Thr Glu Glu 
450 455 460 

Glu He Leu He Leu Asn Asn He Ser Val Asp Val Arg Lys Leu Cvs 
465 470 475 4^0 

Ala Glu Arg Ala Cys Asn Thr Leu Pro Ser Ala Lys Arg Phe Ser Lys 
485 490 495 

Asn His Lys Ser Asn He Gin Ser Ser Arg Gin Glu Arg Arg He Lvs 
500 505 510 

Asp Pro Leu Val Val Leu Lys Asp Thr Leu Tyr Glu Phe Gin His Lvs 
515 520 525 

Arg Ala Gly Trp Gly Ser Arg Ser Thr Arg Asp Leu Gly Ser Arq Ala 
530 535 540 

Asp His Ala Lys Gly Ser Gly 
545 550 

and a molecular weight from about 61 to about 65 kDa, preferably about 63 kDa. 

Yet another DNA molecule of the present invention (GLRaV-2 ORP5) 
includes nucleotides 12844-13515 of SEQ. ID. No. 1 and codes for a diverged coat 
protein. This DNA molecule comprises a nucleotide sequence corresponding to SEQ. ID. 
No. 12 as follows: 



ATGAGTTCCA ACACAAGCGT GCCGGTTGGG GGTCTCGAAG CACTCGAGAC CTCGGGAGTC 60 

GTGCTGACCA CGCGAAAGGA AGCGGTTGAT AAGTTTTTTA ATGAACTAAA AAACGAAAAT 120 

TACTCATCAG TTGACAGCAG CCGATTAAGC GATTCGGAAG TAAAAGAAGT GTTAGAGAAA 180 

AGTAAAGAAA GTTTCAAAAG CGAACTGGCC TCCACTGACG AGCACTTCGT CTACCACATT 24 0 

ATATTTTTCT TAATCCGATG TGCTAAGATA TCGACAAGTG AAAAGGTGAA GTACGTTGGT 300 

AGTCATACGT ACGTGGTCGA CGGAAAAACG TACACCGTTC TTGACGCTTG GGTATTCAAC 360 

ATGATGAAAA GTCTCACGAA GAAGTACAAA CGAGTGAATG GTCTGCGTGC GTTCTGTTGC 4 20 

GCGTGCGAAG ATCTATATCT AACCGTCGCA CCAATAATGT CAGAACGCTT TAAGACTAAA 4 80 

GCCGTAGGGA TGAAAGGTTT GCCTGTTGGA AAGGAATACT TAGGCGCCGA CTTTCTTTCG 54 0 

GGAACTAGCA AACTGATGAG CGATCACGAC AGGGCGGTCT CCATCGTTGC AGCGAAAAAC 600 
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GCTGTCGATC GTAGCGCTTT CACGGGTGGG GAGAGAAAGA TAGTTAGTTT GTATGATCTA 



660 



GGGAGGTACT AA 



672 



The diverged coat protein has an amino acid sequence corresponding to SEQ, ID. No. 13 



Met Ser Ser Asn Thr Ser Val Pro Val Gly Gly Leu Glu Ala Leu Glu 
15 10 15 

Thr Ser Gly Val Val Leu Thr Thr Arg Lys Glu Ala Val Asp Lys Phe 
20 25 30 

Phe Asn Glu Leu Lys Asn Glu Asn Tyr Ser Ser Val Asp Ser Ser Arg 
35 40 45 

Leu Ser Asp Ser Glu Val Lys Glu Val Leu Glu Lys Ser Lys Glu Ser 
50 55 60 

Phe Lys Ser Glu Leu Ala Ser Thr Asp Glu His Phe Val Tyr His lie 
65 70 75 80 

lie Phe Phe Leu lie Arg Cys Ala Lys lie Ser Thr Ser Glu Lys Val 
85 90 95 

Lys Tyr Val Gly Ser His Thr Tyr Val Val Asp Gly Lys Thr Tyr Thr 
100 105 110 

Val Leu Asp Ala Trp Val Phe Asn Met Met Lys Ser Leu Thr Lys Lys 
115 120 125 

Tyr Lys Arg Val Asn Gly Leu Arg Ala Phe Cys Cys Ala Cys Glu Asp 
130 135 140 

Leu Tyr Leu Thr Val Ala Pro lie Met Ser Glu Arg Phe Lys Thr Lys 
145 150 155 160 

Ala Val Gly Met Lys Gly Leu Pro Val Gly Lys Glu Tyr Leu Gly Ala 
165 170 175 

Asp Phe Leu Ser Gly Thr Ser Lys Leu Met Ser Asp His Asp Arg Ala 
180 185 190 

Val Ser lie Val Ala Ala Lys Asn Ala Val Asp Arg Ser Ala Phe Thr 
195 200 205 

Gly Gly Glu Arg Lys lie Val Ser Leu Tyr Asp Leu Gly Arg Tyr 
210 215 220 

and a molecular weight from about 23 to about 27 kDa, preferably about 25 kDa. 



13584-14180 of SEQ. ID. No. 1 and codes for a grapevine leafroll virus coat protein. 
This DNA molecule comprises a nucleotide sequence conesponding to SEQ. ID, No. 14 



as follows: 



Another such DNA molecule (GLRaV-2 ORF6) includes nucleotides 



as follows: 
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ATGGAGTTGA TGTCCGACAG CAACCTTAGC AACCTGGTGA TAACCGACGC CTCTAGTCTA 60 

AATGGTGTCG ACAAGAAGCT TTTATCTGCT GAAGTTGAAA AAATGTTGGT GCAGAAAGGG 120 

GCTCCTAACG AGGGTATAGA AGTGGTGTTC GGTCTACTCC TTTACGCACT CGCGGCTU^GA 180 

ACCACGTCTC CTAAGGTTCA GCGCGCAGAT TCAGACGTTA TATTTTCAAA TAGTTTCGGA 24 0 

GAGAGG7\ATG TGGTAGTAAC AGAGGGTGAC CTTAAGAAGG TACTCGACGG GTGTGCGCCT 300 

CTCACTAGGT TCACTAATAA ACTTAGAACG TTCGGTCGTA CTTTCACTGA GGCTTACGTT 360 

GACTTTTGTA TCGCGTATAA GCACAAATTA CCCCAACTCA ACGCCGCGGC GGAATTGGGG 4 20 

ATTCCAGCTG AAGATTCGTA CTTAGCTGCA GATTTTCTGG GTACTTGCCC GAAGCTCTCT 4 80 

GAATTACAGC AAAGTAGGAA GATGTTCGCG AGTATGTACG CTCTAAAAAC TGAAGGTGGA 54 0 

GTGGTAAATA CACCAGTGAG CAATCTGCGT CAGCTAGGTA GAAGGGAAGT TATGT7VA 597 

The coat protein has an amino acid sequence corresponding to SEQ. ID. No. 15 as 
follows: 



Met Glu Leu Met Ser Asp Ser Asn Leu Ser Asn Leu Val lie Thr Asp 
15 10 15 

Ala Ser Ser Leu Asn Gly Val Asp Lys Lys Leu Leu Ser Ala Glu Val 
20 25 30 

Glu Lys Met Leu Val Gin Lys Gly Ala Pro Asn Glu Gly lie Glu Val 
35 40 45 

Val Phe Gly Leu Leu Leu Tyr Ala Leu Ala Ala Arg Thr Thr Ser Pro 
50 55 60 

Lys Val Gin Arg Ala Asp Ser Asp Val lie Phe Ser Asn Ser Phe Gly 
65 70 75 80 

Glu Arg Asn Val Val Val Thr Glu Gly Asp Leu Lys Lys Val Leu Asp 
85 90 95 

Gly Cys Ala Pro Leu Thr Arg Phe Thr Asn Lys Leu Arg Thr Phe Gly 
100 105 110 

Arg Thr Phe Thr Glu Ala Tyr Val Asp Phe Cys lie Ala Tyr Lys His 
115 120 125 

Lys Leu Pro Gin Leu Asn Ala Ala Ala Glu Leu Gly lie Pro Ala Glu 
130 135 140 

Asp Ser Tyr Leu Ala Ala Asp Phe Leu Gly Thr Cys Pro Lys Leu Ser 
145 150 155 160 

Glu Leu Gin Gin Ser Arg Lys Met Phe Ala Ser Met Tyr Ala Leu Lys 
165 170 175 
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Thr Glu Gly Gly Val Val Asn Thr Pro Val Ser Asn Leu Arg Gin Leu 
180 185 190 

Gly Arg Arg Glu Val Met 
195 

and a molecular weight from about 20 to about 24 kDa, preferably about 22 kDa. 

Another such DNA molecule (GLRaV-2 0RF7) includes nucleotides 
14180-14665 of SEQ, ID. No. 1 and codes for a second undefined grapevine leafroll virus 
protein or polypeptide. This DNA molecule comprises a nucleotide sequence 
corresponding to SEQ. ID. No. 16 as follows: 

ATGGAAGATT ACGAAGAAT^ ATCCGAATCG CTCATACTGC TACGCACG7VA TCTGAACACT 60 
ATGCTTTTAG TGGTCAAGTC CGATGCTAGT GTAGAGCTGC CTAAACTACT AATTTGCGGT 120 
TACTTACGAG TGTCAGGACG TGGGGAGGTG ACGTGTTGCA ACCGTGAGGA ATTAACAAGA 180 
GATTTTGAGG GCAATCATCA TACGGTGATC CGTTCTAGAA TCATACAATA TGACAGCGAG 24 0 

TCTGCTTTTG AGGAATTCAA CAACTCTGAT TGCGTAGTGA AGTTTTTCCT AGAGACTGGT 300 
AGTGTCTTTT GGTTTTTCCT TCGAAGTGAA ACCAAAGGTA GAGCGGTGCG ACATTTGCGC 3 60 

ACCTTCTTCG 7VAGCTAACAA TTTCTTCTTT GGATCGCATT GCGGTACCAT GGAGTATTGT 4 20 

TTGAAGCAGG TACTAACTGA AACTGAATCT ATAATCGATT CTTTTTGCGA AGAAAGAAAT 4 80 

CGTTAA ^86 

The second undefined grapevine leafroll virus protein or polypeptide has a deduced 
amino acid sequence corresponding to SEQ. ID. No. 17 as follows: 



Met Glu Asp Tyr Glu Glu Lys Ser Glu Ser Leu lie Leu Leu Arg Thr 
15 10 15 

Asn Leu Asn Thr Met Leu Leu Val Val Lys Ser Asp Ala Ser Val Glu 
20 25 30 

Leu Pro Lys Leu Leu lie Cys Gly Tyr Leu Arg Val Ser Gly Arg Gly 
35 40 45 

Glu Val Thr Cys Cys Asn Arg Glu Glu Leu Thr Arg Asp Phe Glu Gly 
50 55 60 

Asn His His Thr Val lie Arg Ser Arg He He Gin Tyr Asp Ser Glu 
65 70 75 80 

Ser Ala Phe Glu Glu Phe Asn Asn Ser Asp Cys Val Val Lys Phe Phe 
85 90 95 

Leu Glu Thr Gly Ser Val Phe Trp Phe Phe Leu Arg Ser Glu Thr Lys 
100 105 110 
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Gly Arg Ala Val Arg His Leu Arg Thr Phe Phe Glu Ala Asn Asn Phe 
115 120 125 

Phe Phe Gly Ser His Cys Gly Thr Met Glu Tyr Cys Leu Lys Gin Val 
130 135 140 

Leu Thr Glu Thr Glu Ser lie lie Asp Ser Phe Cys Glu Glu Arg Asn 
145 150 155 160 

Arg 

and a molecular weight from about 17 to about 21 kDa, preferably about 19 kDa. 

Yet another such DNA molecule (GLRaV-2 0RF8) includes nucleotides 
14667-1 5284 of SEQ. ID. No. 1 and codes for a third undefined grapevine leafroll virus 
protein or polypeptide. This DNA molecule comprises a nucleotide sequence 
corresponding to SEQ. ID. No. 18 as follow^s: 



ATGAGGGTTA TAGTGTCTCC TTATGAAGCT GAAGACATTC TGAAAAGATC GACTGACATG 60 

TTACGAAACA TAGACAGTGG GGTCTTGAGC ACTAAAGAAT GTATCAAGGC ATTCTCGACG 120 

ATAACGCGAG ACCTACATTG TGCGAAGGCT TCCTACCAGT GGGGTGTTGA CACTGGGTTA 180 

TATCAGCGTA ATTGCGCTGA AAAACGTTTA ATTGACACGG TGGAGTCAAA CATACGGTTG 24 0 

GCTCAACCTC TCGTGCGTGA T^AAAGTGGCG GTTCATTTTT GTAAGGATGA ACCAAAAGAG 300 

CTAGTAGCAT TCATCACGCG AAAGTACGTG GAACTCACGG GCGTGGGAGT GAGAGAAGCG 360 

GTGAAGAGGG AAATGCGCTC TCTTACCAAA ACAGTTTTAA ATAAAATGTC TTTGG7VAATG 4 20 

GCGTTTTACA TGTCACCACG AGCGTGGAAA AACGCTGAAT GGTTAGAACT AAAATTTTCA 4 80 

CCTGTGAAAA TCTTTAGAGA TCTGCTATTA GACGTGGTIAA CGCTCAACGA ATTGTGCGCC 54 0 

GAAGATGATG TTCACGTCGA CAAAGTAAAT GAGAATGGGG ACGAAAATCA CGACCTCGAA 600 

CTCCAAGACG AATGTTAA 618 

The third undefined protein or polypeptide has a deduced amino acid sequence 
corresponding to SEQ. ID. No. 1 9 as follows: 

Met Arg Val lie Val Ser Pro Tyr Glu Ala Glu Asp lie Leu Lys Arg 
15 10 15 

Ser Thr Asp Met Leu Arg Asn lie Asp Ser Gly Val Leu Ser Thr Lys 
20 25 30 



Glu Cys lie Lys Ala Phe Ser Thr lie Thr Arg Asp Leu His Cys Ala 
35 40 45 
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Lys Ala Ser Tyr Gin Trp Gly Val Asp Thr Gly Leu Tyr Gin Arg Asn 
50 55 60 

Cys Ala Glu Lys Arg Leu lie Asp Thr Val Glu Ser Asn lie Arg Leu 
65 70 75 80 

Ala Gin Pro Leu Val Arg Glu Lys Val Ala Val His Phe Cys Lys Asp 
85 90 95 

Glu Pro Lys Glu Leu Val Ala Phe lie Thr Arg Lys Tyr Val Glu Leu 
100 105 110 

Thr Gly Val Gly Val Arg Glu Ala Val Lys Arg Glu Met Arg Ser Leu 
115 120 125 

Thr Lys Thr Val Leu Asn Lys Met Ser Leu Glu Met Ala Phe Tyr Met 
130 135 140 

Ser Pro Arg Ala Trp Lys Asn Ala Glu Trp Leu Glu Leu Lys Phe Ser 
145 150 155 160 

Pro Val Lys lie Phe Arg Asp Leu Leu Leu Asp Val Glu Thr Leu Asn 
165 170 175 

Glu Leu Cys Ala Glu Asp Asp Val His Val Asp Lys Val Asn Glu Asn 
180 185 190 

Gly Asp Glu Asn His Asp Leu Glu Leu Gin Asp Glu Cys 
195 200 205 

and a molecular weight from about 22 to about 26 kDa, preferably about 24 IcDa. 

Another DNA molecule of the present invention (GLRaV-2 3' UTR) 
includes nucleotides 15285-15500 of SEQ. ID. No. 1 and comprises a nucleotide 
sequence corresponding to SEQ, ID. No. 23 as follows: 

ACATTGGTTA AGTTTAACGA AT^TGATTAG TAAATAATAA ATCGAACGTG GGTGTATCTA 60 
CCTGACGTAT CAACTTAAGC TGTTACTGAG TAATTAAACC AACAAGTGTT GGTGTAATGT 120 
GTATGTTGAT GTAGAGAAAA ATCCGTTTGT AGAACGGTGT TTTTCTCTTC TTTATTTTTA 180 
AAAAAAAAAT AAAAAAAAAA AAAAAAAAGC GGCCGC 216 

5 Also encompassed by the present invention are fragments of the DNA 

molecules of the present invention. Suitable fragments capable of imparting grapevine 
leafroll resistance to grape plants are constructed by using appropriate restriction sites, 
revealed by inspection of the DNA molecule's sequence, to: (i) insert an interposon 
(Felley et al, "Interposon Mutagenesis of Soil and Water Bacteria: a Family of DNA 
1 0 Fragments Designed for in vitro Insertion Mutagenesis of Gram-negative Bacteria," 

Gene , 52:147-15 (1987), which is hereby incorporated by reference) such that truncated 
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forms of the grapevine leafroll virus coat polypeptide or protein, that lack various 
amounts of the C-terminus, can be produced or (ii) delete various internal portions of the 
protein. Alternatively, the sequence can be used to amplify any portion of the coding 
region, such that it can be cloned into a vector supplying both transcription and 
5 translation start signals. 

Suitable DNA molecules are those that hybridize to a DN A molecule 
comprising a nucleotide sequence of at least 15 continuous bases of SEQ. ID. No. 1 under 
stringent conditions characterized by a hybridization buffer comprising 0.9M sodium 
citrate ("SSC") buffer at a temperature of 37°C and remaining bound when subject to 

1 0 washing with SSC buffer at 37°C; and preferably in a hybridization buffer comprising 

20% formamide in 0.9M saline/0.9M SSC buffer at a temperature of 42°C and remaining 
bound when subject to washing at 42°C with 0.2x SSC buffer at 42°C. 

Variants may also (or alternatively) be modified by, for example, the 
deletion or addition of nucleotides that have minimal influence on the properties, 

15 secondary structure and hydropathic nature of the encoded polypeptide. For example, the 
nucleotides encoding a polypeptide may be conjugated to a signal (or leader) sequence at 
the N-terminal end of the protein which co-translationally or post-translationally directs 
transfer of the protein. The nucleotide sequence may also be altered so that the encoded 
polypeptide is conjugated to a linker or other sequence for ease of synthesis, purification, 

20 or identification of the polypeptide. 

The protein or polypeptide of the present invention is preferably produced 
in purified form (preferably, at least about 80%, more preferably 90%, pure) by 
conventional techniques. Typically, the protein or polypeptide of the present invention is 
isolated by lysing and sonication. After washing, the lysate pellet is resuspended in 

25 buffer containing Tris-HCl. During dialysis, a precipitate forms from this protein 

solution. The solution is centrifuged, and the pellet is washed and resuspended in the 
buffer containing Tris-HCL Proteins are resolved by electrophoresis tlirough an SDS 
12% polyacrylamide gel. 

The DNA molecule encoding the grapevine leafroll virus (type 2) protein 

30 or polypeptide of the present invention can be incorporated in cells using conventional 

recombinant DNA technology. Generally, this involves inserting the DNA molecule into 
an expression system to which the DNA molecule is heterologous (i.e. not normally 
present). The heterologous DNA molecule is inserted into the expression system or 
vector in proper sense orientation and correct reading frame. The vector contains the 
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necessary elements for the transcription and translation of the inserted protein-coding 
sequences. 

U.S. Patent No. 4,237,224 to Cohen and Boyer, which is hereby 
incorporated by reference, describes the production of expression systems in the form of 
recombinant plasmids using restriction enzyme cleavage and ligation with DNA ligase. 
These recombinant plasmids are then introduced by means of transformation and 
replicated in unicellular cultures including procaryotic organisms and eucaryotic cells 
grown in tissue culture. 

Recombinant genes may also be introduced into viruses, such as vaccinia 
virus. Recombinant viruses can be generated by transfection of plasmids into cells 
infected with virus. 

Suitable vectors include, but are not limited to, the following viral vectors 
such as lambda vector system gtl 1, gt WES.tB, Charon 4, and plasipid vectors such as 
pBR322, pBR325, pACYC177, pACYCl 84, pUC8, pUC9, pUC18, pUC19, pLG339, 
pR290, pKC37, pKClOl, SV 40, pBluescript 11 SK +/- or KS +/- (see "Stratagene 
Cloning Systems" Catalog (1993) from Stratagene, La Jolla, Calif, which is hereby 
incorporated by reference), pQE, pIH821, pGEX, pET series (see Studier et. al., "Use of 
T7 RNA Polymerase to Direct Expression of Cloned Genes," Gene Expression 
Technology , vol. 185 (1990), which is hereby incorporated by reference), and any 
derivatives thereof. Recombinant molecules can be introduced into cells via 
transformation, transduction, conjugation, mobilization, or electroporation. The DNA 
sequences are cloned into the vector using standard cloning procedures in the art, as 
described by Maniatis et al.. Molecular Cloning: A T aboratorv Manual. Cold Springs 
Laboratory, Cold Springs Harbor, New York (1982), which is hereby incorporated by 
reference. 

A variety of host-vector systems may be utilized to express the protein- 
encoding sequence(s). Primarily, the vector system must be compatible with the host cell 
used. Host-vector systems include but are not limited to the following: bacteria 
transformed with bacteriophage DNA, plasmid DNA, or cosmid DNA; microorganisms 
such as yeast containing yeast vectors; mammalian cell systems infected with virus (e.g., 
vaccinia virus, adenovirus, etc.); insect cell systems infected with virus (e.g., 
baculovirus); and plant cells infected by bacteria or transformed via particle 
bombardment (i.e. biolistics). The expression elements of these vectors vary in their 
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Strength and specificities. Depending upon the host- vector system utilized, any one of a 
number of suitable transcription and translation elements can be used. 

Different genetic signals and processing events control many levels of 
gene expression (e.g., DNA transcription and messenger RNA ("niRNA") translation). 
5 Transcription of DNA is dependent upon the presence of a promoter which 

is a DNA sequence that directs the binding of RNA polymerase and thereby promotes 
mRN A synthesis. The DNA sequences of eucaryotic promoters differ from those of 
procaryotic promoters. Furthermore, eucaryotic promoters and accompanying genetic 
signals may not be recognized in or may not function in a procaryotic system, and, 

10 further, procaryotic promoters are not recognized and do not function in eucaryotic cells. 

Similarly, translation of mRNA in procaryotes depends upon the presence 
of the proper procaryotic signals which differ from those of eucaryotes. Efficient 
translation of mRNA in procaryotes requires a ribosome binding site called the Shine- 
Dalgarno ("SO") sequence on the mRNA. This sequence is a short nucleotide sequence 

1 5 of mRNA that is located before the start codon, usually AUG, which encodes the amino- 
terminal methionine of the protein. The SD sequences are complementary to the 3 '-end 
of the 16S rRNA (ribosomal RNA) and probably promote binding of mRNA to ribosomes 
by duplexing with the rRNA to allow correct positioning of the ribosome. For a review 
on maximizing gene expression, see Roberts and Lauer, Methods in Enzvmology , 68:473 

20 (1979), which is hereby incorporated by reference. 

Promoters vary in their "strength" (i.e. their ability to promote 
transcription). For the purposes of expressing a cloned gene, it is desirable to use strong 
promoters in order to obtain a high level of transcription and, hence, expression of the 
gene. Depending upon the host cell system utilized, any one of a number of suitable 

25 promoters may be used. For instance, when cloning in E. coli, its bacteriophages, or 
plasmids, promoters such as the T7 phage promoter, lac promoter, trp promoter, recA 
promoter, ribosomal RNA promoter, the Pr and Pl promoters of coliphage lambda and 
others, including but not limited, to /^3cUV5, ompF, bla, Ipp, and the like, may be used to 
direct high levels of transcription of adjacent DNA segments. Additionally, a hybrid trp- 

30 lacXiWS {tad) promoter or other E. coli promoters produced by recombinant DNA or other 
synthetic DNA techniques may be used to provide for transcription of the inserted gene. 

Bacterial host cell strains and expression vectors may be chosen which 
inhibit the action of the promoter unless specifically induced. In certain operons, the 
addition of specific inducers is necessary for efficient transcription of the inserted DNA. 
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For example, the lac operon is induced by the addition of lactose or IPTG (isopropylthio- 
beta-D-galactoside). A variety of other operons, such as irp.pro, etc., are under different 
controls. 

Specific initiation signals are also required for efficient gene transcription 
and translation in procaryotic cells. These transcription and translation initiation signals 
may vary in "strength" as measured by the quantity of gene specific messenger RNA and 
protein synthesized, respectively. The DNA expression vector, which contains a 
promoter, may also contain any combination of various "strong" transcription and/or 
translation initiation signals. For instance, efficient translation in E, coli requires a Shine- 
Dalgarno ("SD") sequence about 7-9 bases 5' to the initiation codon ("ATG") to provide 
a ribosome binding site. Thus, any SD-ATG combination that can be utilized by host cell 
ribosomes may be employed. Such combinations include but are not limited to the SD- 
ATG combination from the cro gene or the N gene of coliphage lambda, or from the E, 
coli tryptophan E, D, C, B or A genes. Additionally, any SD-ATG combination produced 
by recombinant DNA or other tecliniques involving incorporation of synthetic nucleotides 
may be used. 

Once the isolated DNA molecules encoding the various grapevine leafroll 
virus (type 2) proteins or polypeptides, as described above, have been cloned into an 
expression system, they are ready to be incorporated into a host cell. Such incorporation 
can be carried out by the various forms of transformation noted above, depending upon 
the vector/host cell system. Suitable host cells include, but are not limited to, bacteria, 
virus, yeast, mammalian cells, insect, plant, and the like. 

The present invention also relates to RNA molecules which encode the 
various grapevine leafroll virus (type 2) proteins or polypeptides described above. The 
transcripts can be synthesized using the host cells of the present invention by any of the 
conventional techniques. The mRNA can be translated either in vitro or in vivo. Cell- 
free systems typically include wheat-germ or reticulocyte extracts. In vivo translation can 
be effected, for example, by microinjection into frog oocytes. 

One aspect of the present invention involves using one or more of the 
above DNA molecules encoding the various proteins or polypeptides of a grapevine 
leafroll virus (type 2) to transform grape plants in order to impart grapevine leafroll 
resistance to the plants. The mechanism by which resistance is imparted is not known. In 
one hypothetical mechanism, the transformed plant can express a protein or polypeptide 
of grapevine leafroll virus (type 2), and, when the transformed plant is inoculated by a 
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grapevine leafroll virus, such as GLRaV-1 , GLRaV-2, GLRav-3, GLRaV-4, GLRaV-5, 
or GLRaV-6, or combinations of these, the expressed protein or polypeptide prevents 
translation of the viral DNA. 

In this aspect of the present invention the subject DNA molecule 
5 incorporated in the plant can be constitutively expressed. Alternatively, expression can 
be regulated by a promoter which is activated by the presence of grapevine leafroll virus. 
Suitable promoters for these purposes include those from genes expressed in response to 
grapevine leafroll virus infiltration. 

The isolated DNA molecules of the present invention can be utilized to 

10 impart grapevine leafroll virus resistance for a wide variety of grapevine plants. The 
DNA molecules are particularly well suited to imparting resistance to Vids scion or 
rootstock cullivars. Scion cultivars which can be protected include those commonly 
referred to as Table or Raisin Grapes, such as Alden, Almeria, Anab-E-Shahi, Autumn 
Black, Beauty Seedless, Black Corinth, Black Damascus, Black Malvoisie, Black Prince, 

15 Blackrose, Bronx Seedless, Burgrave, Calmeria, Campbell Early, Canner, Cardinal, 
Catawba, Cliristmas, Concord, Dattier, Delight, Diamond, Dizmar, Duchess, Early 
Muscat, Emerald Seedless, Emperor, Exotic, Ferdinand de Lesseps, Fiesta, Flame 
seedless, Flame Tokay, Gasconade, Gold, Himrod, Hunisa, Hussiene, Isabella, Italia, July 
Muscat, Khandahar, Katta, Kourgane, Kishmishi, Loose Perlette, Malaga, Monukka, 

20 Muscat of Alexandria, Muscat Flame, Muscat Hamburg, New York Muscat, Niabell, 
Niagara, Olivette blanche, Ontario, Pierce, Queen, Red Malaga, Ribier, Rish Baba, 
Romulus, Ruby Seedless, Schuyler, Seneca, Suavis (IP 365), Thompson seedless, and 
Thomuscat. They also include those used in wine production, such as Aleatico, Alicante 
Bouschet, Aligote, Alvarelhao, Aramon, Baco blanc (22A), Burger, Cabernet franc, 

25 Cabernet, Sauvignon, Calzin, Carignane, Charbono, Chardonnay, Chasselas dore, Chenin 
blanc, Clairette blanche. Early Burgundy, Emerald Riesling, Feher Szagos, Fernao Pires, 
Flora, French Colombard, Fresia, Furmint, Gamay, Gewurztraminer, Grand noir. Gray 
Riesling, Green Hungarian, Green Veltliner, Grenache, Grillo, Helena, Inzolia, Lagrein, 
Lambrusco de Salamino, Malbec, Malvasia bianca, Mataro, Melon, Merlot, Meunier, 

30 Mission, Montua de Pilas, Muscadelle du Bordelais, Muscat blanc, Muscat Ottonel, 
Muscat Saint-Vallier, Nebbiolo, Nebbiolo fmo, Nebbiolo Lampia, Orange Muscat, 
Palomino, Pedro Ximenes, Petit Bouschet, Petite Sirah, Peverella, Pinot noir, Pinot Saint- 
George, Primitivo di Gioa, Red Veltliner, Refosco, Rkatsiteli, Royalty, Rubired, Ruby 
Cabernet, Saint-Emilion, Saint Macaire, Salvador, Sangiovese, Sauvignon blanc. 
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Sauvignon gris, Sauvignon vert, Scarlet, Seibel 5279, Seibel 9110, Seibel 13053, 
Semillon, Servant, Shiraz, Souzao, Sultana Crimson, Sylvaner, Tannat, Teroldico, Tinta 
Madeira, Tinto cao, Touriga, Traminer, Trebbiano Toscano, Trousseau, Valdepenas, 
Viognier, Walschriesling, White Riesling, and Zinfandel. Rootstock cultivars which can 
be protected include Couderc 1202, Couderc 1613, Couderc 1616, Couderc 3309, Dog 
Ridge, Foex 33 EM, Freedom, Ganzin 1 (A x R #1), Harmony, Kober 5BB, LN33, 
Millardet & de Grasset 4 IB, Millardet & de Grasset 420A, Millardet & de Grasset 101- 
14, Oppenheim 4 (S04), Paulsen 775, Paulsen 1045, Paulsen 1 103, Richter 99, Richter 
1 10, Riparia Gloire, Ruggeri 225, Saint-George, Salt Creek, Teleki 5 A, Vitis rupestris 
Constantia, Vids California, and Vitis girdiana. 

There exists an extensive similarity in the hsp70-related sequence regions 
of GLRaV-2 and other closteroviruses, such as tristeza virus and beet yellows virus. 
Consequently, the GLRaV-2 hsp70-related gene can also be used to produce transgenic 
plants or cultivars other than grape, such as citrus or sugar beet, which are resistant to 
closteroviruses other than grapevine leafroU, such as trisleza virus and beet yellows virus. 

Suitable citrus cultivars include lemon, lime, orange, grapefruit, pineapple, 
tangerine, and the like, such as Joppa, Maltaise Ovale, Parson (Parson Brown), Pera, 
Pineapple, Queen, Shamouti, Valencia, Tenerife, Imperial Doblefma, Washington 
Sanguine, Moro, Sanguinello Moscato, Spanish Sanguinelli, Tarocco, Atwood, 
Australian, Bahia, Baiana, Cram, Dalmau, Eddy, Fisher, Frost Washington, Gillette, 
LengNavelina, Washington, Satsuma Mandarin, Dancy, Robinson, Ponkan, Duncan, 
Marsh, Pink Marsh, Ruby Red, Red Seedless, Smooth Seville, Orlando Tangelo, Eureka, 
Lisbon, Meyer Lemon, Rough Lemon, Sour Orange, Persian Lime, West Indian Lime, 
Bearss, Sweet Lime, Troyer Citrange, and Citrus Trifoliata. Each of these citrus cultivars 
is suitable for producing transgenic citrus plants resistant to tristeza virus. 

The economically important species of sugar beet is Beta vulgaris L, 
which has four important cultivar types: sugar beet, table beet, fodder beet, and Swiss 
chard. Each of these beet cultivars is suitable for producing transgenic beet plants 
resistant to beet yellows virus, as described above. 

Because GLRaV-2 has been known to infect tobacco plants (e.g., 
Nicotiana benthamiand), it is also desirable to produce transgenic tobacco plants which 
are resistant to grapevine leafroll viruses, such as GLRaV-2. 
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Plant tissue suitable for transformation include leaf tissue, root tissue, 
meristems, zygotic and somatic embryos, and anthers. It is particularly preferred to 
utilize embryos obtained from anther cultures. 

The expression system of the present invention can be used to transform 
5 virtually any plant tissue under suitable conditions. Tissue cells transformed in accordance 
with the present invention can be grown in vitro in a suitable medium to impart grapevine 
leafroll virus resistance. Transformed cells can be regenerated into whole plants such that the 
protein or polypeptide imparts resistance to grapevine leafroll virus in the intact transgenic 
plants. In either case, the plant cells transformed with the recombinant DNA expression 

10 system of the present invention are grown and caused to express that DNA molecule to 
produce one of the above-described grapevine leafroll virus proteins or polypeptides and, 
thus, to impart grapevine leafroll virus resistance. 

In producing transgenic plants, the DNA construct in a vector described above 
can be microinjected directly into plant cells by use of micropipettes to transfer mechanically 

15 the recombinant DNA. Crossway, Mol. Gen. Genetics . 202:179-85 (1985), which is hereby 
incorporated by reference. The genetic material may also be transferred into the plant cell 
using polyethylene glycol. Krens, et al., Nature . 296:72-74 (1982), which is hereby 
incorporated by reference. 

One technique of transforming plants with the DNA molecules in 

20 accordance with the present invention is by contacting the tissue of such plants with an 
inoculum of a bacteria transformed with a vector comprising a gene in accordance with 
the present invention which imparts grapevine leafroll resistance. Generally, this 
procedure involves inoculating the plant tissue with a suspension of bacteria and 
incubating the tissue for 48 to 72 hours on regeneration medium without antibiotics at 25- 

25 28°C. 

Bacteria from the genus Agrobacterium can be utilized to transform plant 
cells. Suitable species of such bacterium moXwA^ Agrobacterium tumefaciens and 
Agrobacterium rhizogenes, Agrobacterium tumefaciens (e.g., strains C58, LBA4404, or 
EHAl 05) is particularly useful due to its well-known ability to transform plants. 
30 Heterologous genetic sequences can be introduced into appropriate plant cells, 

by means of the Ti plasmid of A. tumefaciens or the Ri plasmid of A. rhizogenes. The Ti or 
Ri plasmid is transmitted to plant cells on infection by Agrobacterium and is stably integrated 
into the plant genome. J. Schell, Science . 237:1 176-83 (1987), which is hereby incorporated 
by reference. 
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After transformation, the transformed plant cells must be regenerated. 

Plant regeneration from cultured protoplasts is described in Evans et ah, 
Handbook of Plant Cell Cultures, Vol. 1 : (MacMillan Publishing Co., New York, 1983); and 
Vasil ].R. (ed.), Cell Culture and Somatic Cell Genetics of Plants . Acad. Press, Orlando, 
5 Vol. I, 1984, and Vol. Ill (1986), which are hereby incorporated by reference. 

It is known that practically all plants can be regenerated from cultured cells or 
tissues, including but not limited to, all major species of sugarcane, sugar beets, cotton, fruit 
trees, and legumes. 

Means for regeneration vary from species to species of plants, but generally a 

1 0 suspension of transformed protoplasts or a petri plate containing explants is first provided. 
Callus tissue is formed and shoots may be induced from callus and subsequently rooted. 
Alternatively, embryo formation can be induced in the callus tissue. These embryos 
germinate as natural embryos to form plants. The culture media will generally contain 
various amino acids and hormones, such as auxin and cytokinins. It is also advantageous to 

15 add glutamic acid and proline to the medium. Efficient regeneration will depend on the 
medium, on the genotype, and on the history of the culture. If these three variables are 
controlled, then regeneration is usually reproducible and repeatable. 

After the expression cassette is stably incorporated in transgenic plants, it can 
be transferred to other plants by sexual crossing. Any of a number of standard breeding 

20 tecliniques can be used, depending upon the species to be crossed. 

Once transgenic plants of this type are produced, the plants themselves can be 
cultivated in accordance with conventional procedure so that the DNA construct is present in 
the resulting plants. Alternatively, transgenic seeds are recovered from the transgenic plants. 
These seeds can then be planted in the soil and cultivated using conventional procedures to 

25 produce transgenic plants. 

Another approach to transforming plant cells with a gene which imparts 
resistance to pathogens is particle bombardment (also Icnown as biolistic transformation) 
of the host cell. This can be accomplished in one of several ways. The first involves 
propelling inert or biologically active particles at cells. This technique is disclosed in 

30 U.S. Patent Nos, 4,945,050, 5,036,006, and 5,100,792, all to Sanford et al., and in 
Emerschad et al., "Somatic Embryogenesis and Plant Development from Immature 
Zygotic Embryos of Seedless Grapes (Vitis viniferd)'' Plant Cell Reports , 14:6-12 (1995) 
("Emerschad (1995)"), which are hereby incorporated by reference. Generally, this 
procedure involves propelling inert or biologically active particles at the cells under 
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conditions effective to penetrate the outer surface of the cell and to be incorporated within 
the interior thereof. When inert particles are utilized, the vector can be introduced into 
the cell by coating the particles with the vector containing the heterologous DNA. 
Alternatively, the target cell can be surrounded by the vector so that the vector is carried 
5 into the cell by the wake of the particle. Biologically active particles (e.g., dried bacterial 
cells containing the vector and heterologous DNA) can also be propelled into plant cells. 

Once a grape plant tissue, citrus plant tissue, beet plant tissue, or tobacco 
plant tissue is transformed in accordance with the present invention, the transformed 
tissue is regenerated to form a transgenic plant. Generally, regeneration is accomplished 

10 by culturing transformed tissue on medium containing the appropriate growth regulators 
and nutrients to allow for the initiation of shoot meristems. Appropriate antibiotics are 
added to the regeneration medium to inhibit the growth of Agrobacierium and to select 
for the development of transformed cells. Following shoot initiation, shoots are allowed 
to develop tissue culture and are screened for marker gene activity. 

1 5 The DNA molecules of the present invention can be made capable of 

transcription to a messenger FINA, which, although encoding for a grapevine leafroll 
virus (type 2) protein or polypeptide, does not translate to the protein. This is l<nown as 
RNA-mediated resistance. When a Vilis scion or rootstock cultivar, or a citrus, beet, or 
tobacco cultivar, is transformed with such a DNA molecule, the DNA molecule can be 

20 transcribed under conditions effective to maintain the messenger RNA in the plant cell at 
low level density readings. Density readings of between 15 and 50 using a Hewlet 
ScanJet and Image Analysis Program are preferred. 

A portion of one or more DNA molecules of the present invention as well 
as other DNA molecules can be used in a transgenic grape plant, citrus plant, beet plant, 

25 or tobacco plant in accordance with U.S. Patent Application Serial No. 09/025,635, which 
is hereby incorporated herein by reference. 

The grapevine leafroll virus (type 2) protein or polypeptide of the present 
invention can also be used to raise antibodies or binding portions thereof or probes. The 
antibodies can be monoclonal or polyclonal. 

30 Monoclonal antibody production may be effected by techniques which are 

well-known in the art. Basically, the process involves first obtaining immune cells 
(lymphocytes) from the spleen of a mammal (e.g., mouse) which has been previously 
immunized with the antigen of interest either in vivo or in vitro. The antibody-secreting 
lymphocytes are then fused with (mouse) myeloma cells or transformed cells, which are 
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capable of replicating indefinitely in cell culture, thereby producing an immortal, 
immunoglobulin-secreting cell line. The resulting fused cells, or hybridomas, are 
cultured, and the resulting colonies screened for the production of the desired monoclonal 
antibodies. Colonies producing such antibodies are cloned, and grown either in vivo or in 
vitro to produce large quantities of antibody. A description of the theoretical basis and 
practical methodology of fusing such cells is set forth in Kohler and Milstein, Nature , 
256:495 (1975), which is hereby incorporated by reference. 

Mammalian lymphocytes are immunized by in vivo immunization of the 
animal (e.g., a mouse) with the protein or polypeptide of the present invention. Such 
immunizations are repeated as necessary at intervals of up to several weeks to obtain a 
sufficient titer of antibodies. Following the last antigen boost, the animals are sacrificed 
and spleen cells removed. 

Fusion with mammalian myeloma cells or other fusion partners capable of 
replicating indefinitely in cell culture is effected by standard and well-known techniques, 
for example, by using polyethylene glycol ("PEG") or other fusing agents. (See Milstein 
and Kohler, Fur. J. Immunol. , 6:51 1 (1976), which is hereby incorporated by reference.) 
This immortal cell line, which is preferably murine, but may also be derived from cells of 
other mammalian species, including but not limited to rats and humans, is selected to be 
deficient in enzymes necessary for the utilization of certain nutrients, to be capable of 
rapid growth, and to have good fusion capability. Many such cell lines are known to 
those skilled in the art, and others are regularly described. 

Procedures for raising polyclonal antibodies are also well known. 
Typically, such antibodies can be raised by administering the protein or polypeptide of 
the present invention subcutaneously to New Zealand white rabbits which have first been 
bled to obtain pre-immune serum. The antigens can be injected at a total volume of 100 
\x\ per site at six different sites. Each injected material will contain synthetic surfactant 
adjuvant pluronic polyols, or pulverized acrylamide gel containing the protein or 
polypeptide after SDS-polyacrylamide gel electrophoresis. The rabbits are then bled two 
weeks after the first injection and periodically boosted with the same antigen thi-ee times 
every six weeks. A sample of serum is then collected 10 days after each boost. 
Polyclonal antibodies are then recovered from the serum by affinity chromatography 
using the corresponding antigen to capture the antibody. Ultimately, the rabbits are 
euthenized with pentobarbital 150 mg/Kg IV. This and other procedures for raising 
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polyclonal antibodies are disclosed in Harlow et. al., editors, Antibodies: A Laboratory 
Manual (1988), which is hereby incorporated by reference. 

In addition to utilizing whole antibodies, binding portions of such 
antibodies can be used. Such binding portions include Fab fragments, F(ab')2 fragments, 
5 and Fv fragments. These antibody fragments can be made by conventional procedures, 
such as proteolytic fragmentation procedures, as described in Coding, Monoclonal 
Antibodies: Principles and Practice . New York:Academic Press, pp. 98-1 1 8 (1 983), 
which is hereby incorporated by reference. 

The present invention also relates to probes found either in nature or 

10 prepared synthetically by recombinant DNA procedures or other biological procedures. 
Suitable probes are molecules which bind to grapevine leafroll (type 2) viral antigens 
identified by the monoclonal antibodies of the present invention. Such probes can be, for 
example, proteins, peptides, lectins, or nucleic acid probes. 

The antibodies or binding portions thereof or probes can be administered 

15 to grapevine leafroll virus infected scion cultivars or rootstock cultivars. Alternatively, at 
least the binding portions of these antibodies can be sequenced, and the encoding DNA 
synthesized. The encoding DNA molecule can be used to transform plants together with a 
promoter which causes expression of the encoded antibody when the plant is infected by 
grapevine leafroll virus. In either case, the antibody or binding portion thereof or probe 

20 will bind to the virus and help prevent the usual leafroll response. 

Antibodies raised against the GLRaV-2 proteins or polypeptides of the 
present invention or binding portions of these antibodies can be utilized in a method for 
detection of grapevine leafroll virus in a sample of tissue, such as tissue (e.g., scion or 
rootstock) from a grape plant or tobacco plant. Antibodies or binding portions thereof 

25 suitable for use in the detection method include those raised against a helicase, a 

methyltransferase, a papain-like protease, an RNA-dependent RNA polymerase, a heat 
shock 70 protein, a heat shock 90 protein, a coat protein, a diverged coat protein, or other 
proteins or polypeptides in accordance with the present invention. Any reaction of the 
sample with the antibody is detected using an assay system which indicates the presence 

30 of grapevine leafroll virus in the sample. A variety of assay systems can be employed, 
such as enzyme-linked immunosorbent assays, radioimmunoassays, gel diffusion 
precipitin reaction assays, immunodiffusion assays, agglutination assays, fluorescent 
immunoassays, protein A immunoassays, or immunoelectrophoresis assays. 



wo 98/53055 



-59- 



PCT/US98/10313 



Alternatively, grapevine leafroll virus can be detected in such a sample using a 
nucleotide sequence of the DNA molecule, or a fragment thereof, encoding for a protein or 
polypeptide of the present invention. The nucleotide sequence is provided as a probe in a 
nucleic acid hybridization assay or a gene amplification detection procedure (e.g., using a 
polymerase chain reaction procedure). The nucleic acid probes of the present invention may 
be used in any nucleic acid hybridization assay system known in the art, including, but not 
limited to. Southern blots (Southern, E.M., "Detection of Specific Sequences Among DNA 
Fragments Separated by Gel Electrophoresis," J. Mol. Biol . 98:503-17 (1975), which is 
hereby incorporated by reference). Northern blots (Thomas, P.S., "Hybridization of 
Denatured RNA and Small DNA Fragments Transferred to Nitrocellulose," Proc. NatM Acad. 
Sci. USA , 77:5201-05 (1980), which is hereby incorporated by reference), and Colony blots 
(Grunstein, M., et al., "Colony Hybridization: A Method for the Isolation of Cloned cDN As 
that Contain a Specific Gene." Proc. Nat^l Acad. Sci. USA , 72:3961-65 (1975), which is 
hereby incorporated by reference). Alternatively, the probes can be used in a gene 
amplification detection procedure (e.g., a polymerase chain reaction). Erlich, H.A., et. al., 
"Recent Advances in the Polymerase Chain Reaction," Science 252:1643-51 (1991), which is 
hereby incorporated by reference. Any reaction with the probe is detected so that the 
presence of a grapevine leafroll virus in the sample is indicated. Such detection is facilitated 
by providing the probe of the present invention with a label. Suitable labels include a 
radioactive compound, a fluorescent compound, a chemiluminescent compound, an 
enzymatic compound, or other equivalent nucleic acid labels. 

Depending upon the desired scope of detection, it is possible to utilize probes 
having nucleotide sequences that correspond with conserved or variable regions of the ORF 
or UTR. For example, to distinguish a grapevine leafroll virus from other related viruses 
(e.g., other closteroviruses), it is desirable to use probes which contain nucleotide sequences 
that correspond to sequences more highly conserved among all grapevine leafroll viruses. 
Also, to distinguish between different grapevine leafroll viruses (i.e., GLRaV-2 from 
GLRaV-1, GLRaV-3, GLRaV-4, GLRaV-5, and GLRaV-6), it is desirable to utilize probes 
containing nucleotide sequences that correspond to sequences less highly conserved among 
the different grapevine leafroll viruses. 

Nucleic acid (DNA or RNA) probes of the present invention will hybridize to 
complementary GLRaV-2 nucleic acid under stringent conditions. Generally, stringent 
conditions are selected to be about 50°C lower than the thermal melting point (Tm) for the 
specific sequence at a defined ionic strength and pH. The Tm is the temperature (under 
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defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly 
matched probe. The Tm is dependent upon the solution conditions and the base composition 
of the probe, and may be calculated using the following equation: 
T„, = 79. 8X + (18.5 X Log[Na-f]) 
5 + (58.4°Cx%[G+C]) 

(820 / #bp in duplex) 
(0.5 X % formamide) 

Nonspecific binding may also be controlled using any one of a number of known techniques 
such as, for example, blocking the membrane with protein-containing solutions, addition of 
10 heterologous RNA, DNA, and SDS to the hybridization buffer, and treatment with RNase. 

Wash conditions are typically performed at or below stringency. Generally, suitable stringent 
conditions for nucleic acid hybridization assays or gene amplification detection procedures 
are asas set forth above. More or less stringent conditions may also be selected. 

15 EXAMPLES 

The following examples are provided to illustrate embodiments of the present 
invention but are by no means intended to limit its scope. 

20 Example 1 - Northern Hybridization 

Specificity of the selected clones was confirmed by Northern hybridization. 
Northern hybridization was performed after electrophoresis of the dsRNA of GLRaV-2 in 1% 
agarose non-denaturing condition gel. The agarose gel was denatured by soaking in 50 mM 
NaOH containing 0.4 M NaCl for 30 min, and then neutralized with 0.1 M Tris-HCl (PH7.5 ) 
25 containing 0.5 M NaCl for another 30 min. RNA was sandwich blotted overnight onto 
Genescreen"^*^ plus membrane (Dupont NEN Research Product) in 10 X SSC buffer and 
hybridized as described by the manufacturer's instructions (DuPont, NEN). 

Example 2 - Sequencing and Computer Assisted Nucleotide and 
30 Amino Acid Sequence Analysis 

DNA inserts were sequenced in pBluescript SK+ by using T3 and T7 
universal primers for the terminal region sequence and additional oligonucleotide primers 



wo 98/53055 



-61 - 



PCT/US98/10313 



designed according to the known sequence for the internal region sequence. Purification of 
plasmid DNA was performed by a modified mini alkaline-lysis/PEG precipitation procedure 
described by the manufacturer (Applied Biosystems, Inc.). Nucleotide sequencing was 
performed on both strands of cDNA by using ABI TaqDyeDeoxy Terminator Cycle 
Sequencing Kit (Applied Biosystems, Inc.). Automatic sequencing was performed on an 
ABI373 Automated Sequencer (Applied Biosynstems, Inc.) at Cornell University, Geneva, 
NY. 

The nucleotide sequences of GLRaV-2 were assembled and analyzed with the 
programs of EditSeq and SeqMan, respectively, of DNASTAR package (Madison, WI). 
Amino acid sequences deduced from nucleotide sequences and its encoding open reading 
frames were conducted using the MapDraw program. Multiple alignments of amino acid 
sequences, identification of consensus amino acid sequences, and generation of phylogenetic 
trees were performed using the Clustal method in the MegAlign program. The nucleotide 
and amino acid sequences of other closteroviruses were obtained with the Entrez Program; 
and sequence comparisons with nonredundant databases were searched with the Blast 
Program from the National Center for Biotechnology Information. 



Example 3 - Isolation of dsRNA 

Several vines of GLRaV-2 infected Vitis vinifera cv Pinot Noir that originated 
from a central New York vineyard served as the source for dsRNA isolation and cDNA 
cloning. dsRNA was extracted from phloem tissue of infected grapevines according to the 
method described by Hu et al., "Characterization of Closterovirus-Like Particles Associated 
with Grapevine Leafroll Disease," J. Phytopathology 128:1-14 (1990), which is hereby 
incorporated by reference. Purification of the high molecular weight dsRNA (ca 15 kb) was 
carried out by electrophoretic separation of the total dsRNA on a 0.7% low melting point 
agarose gel and extraction by phenol/chloroform following the method described by 
Sambrook et al.. Molecular Cloning: A l aboratory Manual . 2nd ed.. Cold Sping Harbor 
Laboratory Press, New York (1989) , which is hereby incorporated by reference. 
Concentration of dsRNA was estimated with UV fluorescent density of an ethidium bromide 
stained dsRNA band in comparison with a known concentration of DNA marker. 
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Example 4 - cDNA Synthesis and Cloning 

cDNA synthesis was performed following the method initially described by 
Jelkmann et al., "Cloning of Four Plant Viruses From Small Quantities of Double-Stranded 
RNA," Phytopathology 79:1250-53 (1989) and modified by Ling et al., "The Coat Protein 
Gene of Grapevine Leafroll Associated Closterovirus-3 : Cloning, Nucleotide Sequencing and 
Expression in Transgenic Plants," Arch. Virology 142:1 101-16 (1997), both of which are 
hereby incorporated by reference. About 100 ng of high molecular weight dsRNA purified 
from low melting agarose gel was denatured in 20 mM methylmercuric hydroxide and 
incubated at room temperature for 10 min with 350 ng of random primers. First strand cDNA 
was synthesized by using avian myeloblastosis virus (AMV) reverse transcriptase. Second 
strand cDNA was obtained by using RNase H and E.coU DNA polymerase I. Double- 
stranded cDNA was blunt ended with T4 DNA polymerase and ligated with EcoR 1 adapters. 
The cDNA, which had EcoR 1 adapters at the ends, was activated by kinase reaction and 
ligated into Lambda ZAP II/EcoR I prepared arms following the manufacturer's instruction 
(Stratagene). The recombinant DNA was then packaged in vitro to Gigapack® II packaging 
extract (Stratagene). The packaged phage particles were amplified and titered according to 
the manufacturer's instruction. 

Two kinds of probes were used to identify GLRaV-2 specific clones from the 
library. One type was prepared from the synthesized cDNA that was amplified by PCR after 
ligation to the specific EcoR I Uni-Amp™ adapters (Clontech); and the other type was DNA 
inserts or PCR products fi-om already sequenced clones. Clones from the cDNA library were 
selected by colony-lifting hybridization onto the colony/plaque Screen membrane (NEN 
Research Product) with the probe described above. The probe was prepared by labeling with 
^^P [a-dATP] using Klenow fragment of E.coli DNA polymerase I. Prehybridization, 
hybridization, and washing steps were carried out at 65°C according to the manufacturer's 
instruction (Dupont, NEN Research Product). Selected plaques were converted to 
recombinant pBluescript by in vivo excision method according to the manufacturer's 
instruction (Stratagene). 

To obtain clones representing the extreme 3 '-terminus of GLRaV-2, dsRNA 
was polyadenylated by yeast poly(A) polymerase. Using poly(A)-tailed dsRNA as template, 
cDNA was amplified by RT-PCR with oligo(dT)] 8 and a specific primer, CP-1/T7R, which 
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is derived from the clone CP-1 and has a nucleotide sequence according to SEQ. ID. No. 20 
as follows: 

TGCTGGAGCT TGAGGTTCTG C 21 

The resulting PGR product (3'-PCR) was cloned into a TA vector (Invitrogen) and 
sequenced. 

As shown in Figure 1 A, a high molecular weight dsRNA of ca. 15 kb was 
consistently identified from GLRaV-2 infected grapevines, but not from healthy vines. In 
addition, several low molecular weight dsRNAs were also detected from infected tissue. 
The yield of dsRNA of GLRaV-2 was estimated between 5-10 ng/15 g phloem tissue, which 
was much lower than that of GLRaV-3 (Hu et al., "Characterization of Closterovirus-Like 
Particles Associated with Grapevine Leafroll Disease," J. Phytopathology 128:1-14 (1990), 
which is hereby incorporated by reference). Only the high molecular weight dsRNA that was 
purified from low melting point agarose gel was used for cDNA synthesis, cloning and 
establishment of the Lambda/ZAP II cDNA library. 

Two kinds of probes were used for screening the cDNA library. The initial 
clones were identified by hybridization with Uni-Amp^" PCR-amplified cDNA as probes. 
The specificity of these clones (e.g., TC-1) ranging from 200 to 1,800 bp in size was 
confirmed by Northern hybridization to dsRNA of GLRaV-2 as shown in Figure IB. 
Additionally, over 40 different clones ranging form 800 to 7,500 bp in size were identified 
following hybridization with the probes generated from GLRaV-2 specific cDNA clones or 
from PGR products. Over 40 clones were then sequenced on the both strands (Figure 2). 

Example 5 - Expression of the Coat Protein in E. coli and Immunoblotting 

To determine that 0RF6 was the coat protein gene of GLRaV-2, the complete 
ORF6 DNA molecule was subcloned from a PGR product and inserted into the fusion protein 
expression vector pMAL-C2 (New England Biolabs, Inc.). The specific primers used for the 
PGR reaction were GP-96F and GP-96R, in which an EcoR I or BaniH I site was included to 
facilitate cloning. GP-96F was designed to include the start codon of the GP and comprises a 
nucleotide sequence according to SEQ. ID. NO. 21 as follows: 



CGGAATTCAC CATGGAGTTG ATGTCCGACA G 



31 
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CP-96R was 66 nucleotides downstream of the stop codon of the CP and comprises the 
nucleotide sequence corresponding to SEQ. ID. No. 22 as follows: 

AGCGGATCCA TGGCAGATTC GTGCGTAGCA GTA 

5 

The coat protein was expressed as a fusion protein with maltose binding protein (MBP) of £*. 
coli under the control of a "tac" promoter and suppressed by the "lac'' repressor. The MBP- 
CP fusion protein was induced by adding 0.3 mM isopropyl-p-D-thio-gloactopyranoside 
(IPTG) and purified by a one step affinity column according to the manufacturer's instruction 

1 0 (New England, Biolabs, Inc ). The MBP-CP fusion protein or the coat protein cleaved from 
the fusion protein was tested to react with specific antiserum of GLRaV-2 (kindly provided 
by Dr. Charles Greif of INRA, Colmar, France) on Western blot according to the method 
described by Hu et al., "Characterization of Closterovirus-Like Particles Associated with 
Grapevine Leafroll Disease," J. Phytopatholo g y 128:1-14 (1990), which is hereby 

1 5 incorporated by reference. In contrast, the non-recombinant plasmids or uninduced cells did 
not react to the antiserum of GLRaV-2. 

E:xamDle 6 - Sequence Analysis and Genome Organization of GLRaV-2 

20 A total of 1 5,500 bp of the RNA genome of GLRaV-2 was sequenced and 

deposited in GenBank (accession number AF039204). About 85% of the total RNA genome 
was revealed from at least two different clones. The sequence in the coat protein gene region 
was determined and confirmed from several different overlapping clones. The genome 
organization of GLRaV-2, shown in Figure 2, includes nine open reading frames (e.g., 

25 ORFla, lb-8). 

ORFla and ORFlb: Analysis of the amino acid sequence of the N-terminal 
portion of GLRaV-2 ORFla encoded product revealed two putative papain-like protease 
domains, which showed significant similarity to the papain-like leader protease of BYV 
(Agranovsky et ah, "Beet Yellows Closterovirus: Complete Genome Structure and 

30 Identification of a Papain-like Thiol Protease," Virology 198:31 1-24 (1994), which is hereby 
incorporated by reference). Thus, it allowed prediction of the catalytic cysteine and histidine 
residues for the putative GLRaV-2 protease. Upon alignment of the sequence of the papain- 
like protease of BYV with that of GLRaV-2, the cleavage site at residues Gly-Gly (amino 
acid 588-589) of BYV aligned with the corresponding alanine-glycine (Ala-Gly) and Gly-Gly 

35 dipeptide of GLRaV-2 (Figure 3 A). Cleavage at this site would resuh in a leader protein and 
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a 234 IcDa (2090 amino acid) C-terminal fragment consisting of MT and HEL domains. 
However, the region upstream of the papain-like protease domain in GLRaV-2 did not show 
similarity to the corresponding region of BYV. In addition, variability in the residues located 
at the scissible bond (Gly in the BYV and Ala in the GLRaV-2) was present. Similar 
variability of the cleavage site residue in the P-PRO domain has been described in LChV 
(Jellanami et al., "Complete Genome Structure and Phylogenetic Analysis of Little Cherry 
Virus, a Mealybug-Transmissible Closterovirus. J. General Virology 78:2067-71 (1997), 
which is hereby incorporated by reference). 

Database searching with the deduced amino acid sequence of the ORFla/lb 
encoded protein revealed a significant similarity to the MT, HEL and RdRP domains of the 
other closteroviruses. The region downstream of the P-PRO cleavage site showed a 
significant similarity (57.4% identity in a 266-residues alignment) to the putative 
methyltransferase domain of BYV and contained all the conserved motifs typical of positive- 
strand RNA viral type I MTs (Figure 3B). The C-terminal portion of the ORFla was 
identified as a helicase domain, the sequence of which showed a high similarity (57.1% 
identity in a 31 5-residues alignment) to the helicase domain of BYV and contained the seven 
conserved motifs characteristic of the Superfamily I helicase of positive-strand RNA viruses 
(Figure 3C) (Hodgman, "A New Superfamily of RepHcative Proteins," Nature 333:22-23 
(1988); Koonin and Dolja, "Evolution and Taxonomy of Positive-strand RNA Viruses: 
Implications of Comparative Analysis of Amino Acid Sequences," Crit. Rev, in Biochem. 
and Mol. Biol. 28:375-430 (1993), both of which are hereby incorporated by reference). 

ORFlb encoded a 460 amino acid polypeptide with a molecular mass of 
52,486 Da, counting from the frameshifting site. Database searching with the RdRP showed 
a significant similarity to the RdRP domains of positive strand RNA viruses. Comparison of 
the RdRP domains of GLRaV-2 and BYV showed the presence of the eight conserved motifs 

of RdRP (Figure 3D). 

As shown in Figure 8, a tentative phylogenetic tree of the RdRP of GLRaV-2 
with respect to other closteroviruses shows that it is closely related to the monopartite 
closteroviruses BYV, BYSV, and CTV. 

In closteroviruses, a +1 ribosomal frameshift mechanism has been suggested 
to be involved in the expression of ORFlb as a large fusion protein with ORFla (Agranovsky 
et al., "Beet Yellows Closterovirus: Complete Genome Structure and Identification of a 
Papain-like Thiol Protease," Virology 198:31 1-24 (1994); Karasev et al., "Complete 
Sequence of the Citrus Tristeza Virus RNA Genome," Virology 208:51 1-20 (1995); Klaassen 
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et al., "Genome Structure and Phylogenetic Analysis of Lettuce Infectious Yellows Virus, a 
Whitefly-Transmitted, Bipartite Closterovirus," Virology 208:99-110 (1995); Karasev et al., 
"Organization of the 3'-Terminal Half of Beet Yellow Stunt Virus Genome and Implications 
for the Evolution of Closteroviruses," Virology 221:199-207 (1996); Jelkmann et al., 
"Complete Genome Structure and Phylogenetic Analysis of Little Cherry Virus, a Mealybug- 
Transmissible Closterovirus," J. General Virology 78:2067-71 (1997), all of which are 
hereby incorporated by reference). In the overlapping ORFla/lb region of BYV, the slippery 
sequence of GGGUUUA and two hairpins structure (stem-loop and pseudoknot) are believed 
to result in a +1 frameshift (Agranovsky et al., "Beet Yellows Closterovirus: Complete 
Genome Structure and Identification of a Papain-like Thiol Protease," Virology 198:3 1 1-24 
(1994), which is hereby incorporated by reference). None of these features are conserved in 
CTV and BYSV (Karasev et al., "Complete Sequence of the Citrus Tristeza Virus RNA 
Genome," Virology 208:51 1-20 (1995); Karasev et al., "Organization of the 3'-Terminal Half 
of Beet Yellow Stunt Virus Genome and Implications for the Evolution of Closteroviruses," 
Virology 221:199-207 (1996), both of which are hereby incorporated by reference), in which 
a ribosomal pausing at a terminator or at a rare codon was suggested to perform the same 
function. Comparisons of the nucleotide sequence of the C-terminal region of the helicase 
and the N-terminal region of RdRP of GLRaV-2 with the same region of other 
closteroviruses revealed a significant similarity to BYV, BYSV, and CTV. As shown in 
Figure 4, the terminator UAG at the end of C'-terminal helicase of GLRaV-2 aligned with the 
terminator UAG of BYV and BYSV, and arginine CGG codon of CTV. 

ORP2 encodes a small protein consisting of 1 71 bp (57 amino acid) with a 
molecular mass of 6,297 Da. As predicted, the deduced amino acid sequence includes a 
stretch of nonpolar amino acids, which is presumed to form a transmembrane helix. A small 
hydrophobic analogous protein is also present in BYV, BYSV, CTV, LIYV, and LChV 
(Agranovsky et al. "Nucleotide Sequence of the 3'-Terminal Half of Beet Yellows 
Closterovirus RNA Genome Unique Arrangement of Eight Virus Genes," J. General 
V'roloRV 72:15-24 (1991); Karasev et al., "Organization of the 3'-Terminal Half of Beet 
Yellow Stunt Virus Genome and Implications for the Evolution of Closteroviruses," Virology 
221 :199-207 (1996); Pappu et al., "Nucleotide Sequence and Organization of Eight 3' Open 
Reading Frames of the Citrus Tristeza Closterovirus Genome," Virology 199:35-46 (1994); 
Klaassen et al., "Partial Characterization of the Lettuce Infectious Yellows Virus Genomic 
RNAs, Identification of the Coat Protein Gene and Comparison of its Amino Acid Sequence 
With Those of Other Filamentous RNA Plant Viruses," J. General Virology 75:1525-33 
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(1994); Jelkmann et al., "Complete Genome Structure and Phylogenetic Analysis of Little 
Cherry Virus, a Mealybug-Transmissible Closterovirus," J. General Viroloev 78:2067-71 
(1997), all of which are hereby incorporated by reference). 

ORF3 encodes a 600 amino acid polypeptide with a molecular mass of 65,1 1 1 
Da, which is homologous to the HSP70 cellular heat shock protein. HSP70 is highly 
conserved among closteroviruses and is probably involved in ATPase activity and the protein 
to protein interaction for chaperone activity (Agranovsky et al. 'The Beet Yellows 
Closterovirus p65 Homologue of HSP70 Chaperones has ATPase Activity Associated with 
its Conserved N-ierminal Domain but Interact with Unfolded Protein Chains," J. General 
Virology 78:535-42 (1997); Agranovsky et al., "Bacterial Expression and Some Properties of 
the p65, a Homologue of Cell Heat Shock Protein HSP70 Encoded in RNA Genome of Beet 
Yellows Closterovirus." Dokladv Akademii Nauk. 340:416-1 8 (1995); Karasev et al., 
"HSP70-Re]ated 65-kDa Protein of Beet Yellows Closterovirus is a Microtubule-Binding 
Protein," FEBS Leners 304:12-14 (1992), all of which are hereby incorporated by reference). 
As shown in Figure 5, alignment of the complete ORF3 of GLRaV-2 with HSP70 homolog 
of BYV revealed the presence of the eight conserved motifs. The percentage similarity of the 
HSP70 between GLRaV-2 and that of BYV, BYSV, CTV, LIYV, and LChV is 47.8%, 
47.2%, 38.6%, 20.9%, and 17.7%, respectively. 

ORF4 encodes a 551 amino acid protein with a molecular mass of 63,349 Da. 
Database searching with the ORF4 protein product did not identify similar proteins except 
those of its counterparts in closteroviruses, BYV (P64), BYSV (P61), CTV (P61), LIYV 
(P59), and LChV (P61) . This protein is believed to be a putative heat shock 90 protein. As 
shown in Figure 9, two conserved motifs which were present in BYV (Agranovsky et al. 
"Nucleotide Sequence of the 3'-Terminal Half of Beet Yellows Closterovirus RNA Genome 
Unique Arrangement of Eight Virus Genes " .T. General Virology 72:15-24 (1991), which is 
hereby incorporated by reference) and CTV (Pappu et al., "Nucleotide Sequence and 
Organization of Eight 3' Open Reading Frames of the Citrus Tristeza Closterovirus Genome," 
Virology 199:35-46 (1994), which is hereby incorporated by reference) were also identified 
in the ORF4 ofGLRaV-2. 

ORF5 and 0RF6 encode polypeptides with molecular mass of 24,803 Da and 
21,661 Da, respectively. The start codon for both ORFs is in a favorable context for 
translation. 0RF6 was identified as the coat protein gene of GLRaV-2 based on the sequence 
comparison with other closteroviruses. The calculated molecular mass of the protein product 
of 0RF6 (21,662 Da) is in good agreement with the previously estimated 22-26 kDa based 
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on SDS-PAGE (Zimmermaiin et al., "Characterization and Serological Detection of Four 
Closterovirus-like Particles Associated with Leafroll Disease on Grapevine," J, 
Phytopathology 130:205.18 (1990); Boscia et al., "Nomenclature of Grapevine Leafroll- 
Associated Putative Closteroviruses." Vitis 34:171-75 (1995), both of which are hereby 
incorporated by reference). 

Database searching with the deduced amino acid sequence of the ORF6 of 
GLRaV-2 showed a similarity with the coat proteins of closteroviruses, BYV, BYSV, CTV, 
LIYV, LChV, and GLRaV-3. At the nucleotide level, the highest percentage similarity was 
with the coat protein of BYSV (34.8%); at the amino acid level, the highest percentage 
similarity was with the coat proteins of BYV (32.7%) and BYSV (32.7%). As shown in 
Figure 6A, alignment of the amino acid sequence of the coal protein and coat protein 
duplicate of GLRaV-2 with respect to other closteroviruses revealed that the invariant amino 
acid residues ( N. R. G. D.) were present in both ORF5 and 0RF6 of GLRaV-2. Two of 
these amino acid residues (R and D) are believed to be involved in stabilization of molecules 
by salt bridge formation and proper folding in the most conserved core region of coat proteins 
of all filamentous plant viruses (Dolja et al., "Phylogeny of Capsid Proteins of Rod-Shaped 
and Filamentous RNA Plant Viruses Two Families With Distinct Patterns of Sequence and 
Probably Structure Conservation," Virology 184:79-86 (1991), which is hereby incoiporated 
by reference). 

Identification of 0RF6 as the coat protein gene was further confirmed by 
Western blot following expression of a fusion protein, consisting of a 22 kDa of 0RF6 CP 
and a 42 kDa of maltose binding protein, produced by transformed E, coli as described in 
Example 5 supra. As shown in Figure 6B, the putative phylogenetic tree of the coat protein 
and coat protein duplicate of GLRaV-2 with those of other closteroviruses showed that 
GLRaV-2 is more closely related to aphid transmissible closteroviruses (BYV, BYSV, and 
CTV) (Candresse, "Closteroviruses and Clostero-like Elongated Plant Viruses," in 
Encvclopedia of Virology, pp. 242-48, Webster and Granoff, eds.. Academic Press, New 
York (1 994), which is hereby incoiporated by reference) than to whitefly (LIYV) or 
mealybug transmissible closteroviruses (LChV and GLRaV-3) (Raine et al., "Transmission 
of the Agent Causing Little Cherry Disease by the Apple Mealybug Phenacoccus aceris and 
the Dodder Cuscuta Lupuliformis"' Canadian J. Plant Pathology 8:6-1 1 (1986); Jelkmann et 
al., "Complete Genome Structure and Phylogenetic Analysis of Little Cherry Virus, a 
Mealybug-Transmissible Closterovirus," J. General Virology 78:2067-71 (1997); Rosciglione 
and Gugerli, "Transmission of Grapevine Leafroll Disease and an Associated Closterovirus 
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to Healthy Grapevine by the Mealybug Planococcusficus," Phytoparasitica 17:63 (1989); 
Engelbrecht and Kasdorf, "Transmission of Grapevine Leafroll Disease and Associated 
Closteroviruses by the Vine Mealybug planococcus-ficus," Phvtophlactica. 22:341-46 
(1990); Cabaleiro and Segura, 1997; Petersen and Charles, "Transmission of Grapevine 
LeafroU-Associated Closteroviruses by Pseudococciis longispinus and P. calceolariae. Plant 
Pathology 46:509-15 (1997), all of which are hereby incorporated by reference). 

ORF7 and ORF8 encode polypeptides of 162 amino acid with a molecular 
mass of 1 8,800 Da and of 206 amino acid with a molecular mass of 23,659 Da, respectively. 
Database searching with the 0RF7 and 0RF8 showed no significant similarity with any other 
proteins. Nevertheless, these genes were of similar in size and location as those observed in 
the sequence of other closteroviruses, BYV (P20, P21), BYSV (PI 8, P22), and LChV (P21, 
P27) (Figure 7). However, conserved regions were not observed between the 0RF7 or ORF8 
and its counterparts in BYV, BYSV, and LChV. 

The 3' terminal untranslated region (3'-UTR) consists of 216 nucleotides. 
Nucleotide sequence analysis revealed a long oligo(A) tract close to the end of the GLRaV-2 
genome which is similar to that observed in the genome of BYV and BYSV (Agranovsky et 
al. "Nucleotide Sequence of the 3'-Terminal Half of Beet Yellows Closterovirus RNA 
Genome Unique Arrangement of Eight Virus Genes," .1. General Virology 72:15-24 (1991); 
Karasev et al., "Organization of the 3'-Terminal Half of Beet Yellow Stunt Virus Genome 
and Implications for the Evolution of Closteroviruses," Virology 221 : 199-207 (1996), both of 
which are hereby incorporated by reference). The genome of BYV ends in CCC, BYSV, and 
CTV ends in CC with an additional G or A in the double-stranded replicative form of BYSV 
(Karasev et al., "Organization of the 3'-Terminal Half of Beet Yellow Stunt Virus Genome 
and Implications for the Evolution of Closteroviruses," Virology 221 : 199-207 (1996), which 
is hereby incorporated by reference) and CTV (Karasev et al., "Complete Sequence of the 
Citrus Tristeza Virus RNA Genome," Virology 208:51 1-20 (1995), which is hereby 
incorporated by reference), respectively . GLRaV-2 had CGC at the 3' terminus of the 
genome. Recently, a conserved 60 nt cis-element was identified in the 3'-UTR of three 
monopartite closteroviruses, which included a prominent conserved stem and loop structure 
(Karasev et al., 1996). As shown in Figure 10, alignment of the 3'-UTR sequence of 
GLRaV-2 with the same regions of BYV, BYSV, and CTV showed the presence of the same 
conserved 60 nt stretch. Besides this cis-element, conserved sequences were not found in the 
3' UTRs of GLRaV-2, BYV, BYSV, and CTV. 
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The closteroviruses studied so far (e.g., BYV, BYSV, CTV, LIYV, LChV, and 
GLRaV-3) have apparent similarities in genome organization, which include replication 
associated genes that consist of MT, HEL, and RdRP conserved domains and a five-gene 
array unique for closteroviruses (Dolja et al. "Molecular Biology and Evolution of 
5 Closteroviruses: Sophisticated Build-up of Large RNA Genomes," Annual Rev. 

Photopathology 32:261-85 (1994); Agranovsky "Principles of Molecular Organization, 
Expression, and Evolution of Closteroviruses: Over the Barriers," Adv. in Virus Res. 47:1 19- 
218 (1 996); Jelkmann et al., "Complete Genome Structure and Phylogenetic Analysis of 
Little Cherry Virus, a Mealybug-Transmissible Closterovirus " J. General Virologv 78:2067- 
10 71 (1 997); Ling et al., "Nucleotide Sequence of the 3' Terminal Two-Thirds of the Grapevine 
Leafroll Associated Virus-3 Genome Reveals a Typical Monopartite Closterovirus," L 
General Virology 79(5):1289-1301 (1998), all of which are hereby incorporated by 
reference). 

The above data clearly shows that GLRaV-2 is a closterovirus. In the genome 

1 5 of GLRaV-2, two putative papain-like proteases were identified and an autoproteolytic 

cleavage process was predicted. The replication associated proteins consisting of MT, HEL, 
and RdRP conserved motifs were also identified, which were phylogenetically closely related 
to the replication associated proteins of other closteroviruses. A unique gene array including 
a small hydrophobic transmembrane protein, HSP70 homolog, HSP90 homolog, diverged CP 

20 and CP was also preserved in GLRaV-2. In addition, the calculated molecular mass (21 ,661 
Da) of the coat protein (ORF6) of GLRaV-2 is in good agreement with that of the other 
closteroviruses (22 to 28 kDa) (Martelli and Bar-Joseph, "Closteroviruses: Classification and 
Nomenclature of Viruses," Fifth Report of the Internationa] Committee on Taxonomy of 
Viruses, Francki et al., eds., Springer-Verlag Wein, New York, p. 345-47 (1991); Candresse 

25 and Martelli, "Genus Closterovirus"" in Virus Taxonomy. Report of the International 

Committee on Taxonomy of Viruses . Murphy et al., eds., Springer-Verlag., NY, p. 461-63 
(1995), both of which are hereby incorporated by reference). Two ORFs downstream of the 
CP are of similar, in size and location, to those observed in the genome of BYV. 
Furthermore, lack of a poly(A) tail at the 3' end of GLRaV-2 is also in good agreement with 

30 other closteroviruses. Like all other closteroviruses, the expression of ORFlb is suspected to 
occur via a +1 ribosomal frameshift and the 3 'proximal ORFs are probably expressed via 
formation of a nested set of subgenomic RNAs. Since the slippery sequence, stem-loop and 
pseudoknot structure involved in the frameshift of BYV were absent in GLRaV-2, the +1 
frameshift of GLRaV-2 might be the same as proposed for CTV (Karasev et al., "Complete 
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Sequence of the Citrus Tristeza Virus RNA Genome," Virology 208:51 1-20 (1995), which is 
hereby incorporated by reference) and BYSV (Karasev et al., "Organization of the 3'- 
Terminal Half of Beet Yellow Stunt Virus Genome and Implications for the Evolution of 
Closteroviruses," Virology 221 : 199-207 (1 996), which is hereby incorporated by reference). 

Overall, GLRaV-2 is more closely related to monopartite closteroviruses 
BYV, BYSV, and CTV than to GLRaV-3 (Figure 7) (Ling et al., "Nucleotide Sequence of 
the 3' Terminal Two-Thirds of the Grapevine LeafroU Associated Virus-3 Genome Reveals a 
Typical Monopartite Closterovirus," 1 General Virology 79(5):1289-1301 (1998), which is 
hereby incorporated by reference), even though the latter causes similar leafroll symptoms in 
grapevine (Rosciglione and Gugerli, "Maladies de I'Enroulement et du Bois Strie de la Vigne: 
Analyse Microscopique et Serologique (Leafroll and Stem Pitting of Grapevine: 
Microscopical and Serological Analysis)," Rev Suisse Viticult Arboricult Horticulture 
18:207-1 1 (1986); Hu et al., "Characterization of Closterovirus-Like Particles Associated 
with Grapevine Leafroll Disease," .). Phytopathology 128:1-14 (1990), both of which are 
hereby incorporated by reference). 

Closteroviruses are a diverse group with complex and heterogeneous genome 
organizations. So far, GLRaV-2 is the only closterovirus that matches with the genome 
organization of BYV, the type member of the genus Closterovirus. In addition, the genomic 
RNA of GLRaV-2 is about the same size as that of BYV; however, the transmission vector of 
GLRaV-2 is unknown. The genome organization of GLRaV-2 is more closely related to the 
aphid transmissible closteroviruses (BYV and CTV) than to whitefly (LIYV) or mealybug 
transmissible closteroviruses (LChV and GLRaV-3). Thus, it is possible that GLRaV-2 is 
transmitted by aphids. Aphid transmission experiments with GLRaV-2 should provide 
information that might help develop methods for further control of GLRaV-2. 

A total of 15,500 nucleotides or over 95% of the estimated GLRaV-2 genome 
has been cloned and sequenced. GLRaV-2 and GLRaV-3 (Ling et al., "Nucleotide Sequence 
of the 3' Terminal Two-Thirds of the Grapevine Leafroll Associated Virus-3 Genome 
Reveals a Typical Monopartite Closterovirus," J. General Virology 79(5):1289-1301 (1998), 
which is hereby incorporated by reference) are the first grapevine leafroll associated 
closteroviruses that have been almost completely sequenced. The above data clearly justify 
the inclusion of GLRaV-2 into the genus Closterovirus. In addition, the information 
regarding the genome of GLRaV-2 would provide a better understanding of this and related 
GLRaVs, and add fundamental knowledge to the group of closteroviruses. 
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Example 7 - Construction of the CP Gene of GLRaV-2 in Plant Expression Vector 

GLRaV-2 infected Vitis vinifera . cv Pinot Noir grapevines originated from a 
vineyard in central New York was used as the virus isolate, from which the cp gene of 
5 GLRaV-2 was identified. Based on the sequence information, two oligonucleotide primers 
have been designed. The sense primer CP-96F (SEQ. ID. No. 21) starts from the ATG 
initiation codon of the coat protein gene and the complementary primer CP-96R (SEQ. ID. 
No. 22) starts from 56 nucleotides downstream of the stop codon of the CP gene. A Nco 1 
restriction site (1 1 bp in SEQ. ID. No. 21 and 13 bp in SEQ. ID. No. 22) is introduced in the 

10 begimiing of both primers to facilitate the cloning. The coat protein gene of GLRaV-2 was 
amplified from dsRNA extracted from GLRaV-2 infected grapevine using reverse 
transcriptase polymerase chain reaction (RT-PCR). The PCR-amplified CP product was 
purified from low melting temperature agarose gel, digested with Nco I and cloned into the 
same enzyme digested plant expression vector pEPT8 (shown at Figure 1 1). After screening, 

1 5 the orientation of recombinant construct was checked by using the internal restriction site of 
the CP gene and directly sequencing the CP gene. The recombinant construct with 
translatable (sense) full length coat protein gene, pEPT8CP-GLRaV2, was going through for 
the further cloning. The plant expression cassette, which consisted of a double cauliflower 
mosaic virus (CaMV) 35S-enhancer, a CaMV 35S-promoter, an alfalfa mosaic virus 

20 (ALMV) RNA4 5' leader sequence, a coat protein gene of GLRaV-2 (CP-GLRaV-2), and a 
CaMV 35S 3' untranslated region as a terminator, was cut using the EcoR 1 restriction 
enzyme, isolated from low melting point temperature agarose gel, and cloned into the same 
restriction enzyme treated binary vector pGA482GG or pGA482G (a derivative of pGA482 
(An et al., "Binary Vectors," in Plant Molecular Biology Manual , pp. A3: 1-1 9, Gelvin and 

25 Schilperoot, eds., Kinwer Academic Publishers, Dordrecht, Netherlands (1 988), which is 
hereby incorporated by reference). The resulting recombinants constructs are 
pGA482GG/EPT8CP'GLRaV2 (shown at Figure 1 1 A), which contain both neomycin 
phosphotransferase (npt II) and ^-glucuronidase (GUS) at the internal region of the T-DNA, 
and pGA482G/EPT8CP-GLRaV2 (shown at Figure 1 IB) without GUS. These recombinants 

30 constructs were separately introduced by electroporation into disarmed avirulent 

Agrobacterium tumefaciens strain C58Z707. The Agrobacrerium lumefaciens containing the 
vector was used to infect Nicotiana benthamiana wounded leaf disks according to the 
procedure essentially described by Horsch et al., "A Simple and General Method for 
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Transferring Genes into Plants," Science 277:1229-1231 (1985), which is incorporated herein 
by reference. 

Example 8 - Analysis of Transgenic Nicotiana bentliamiana Plants with the CP Gene of 
GLRaV-2 

NPT II-ELISA: Double-antibody sandwich enzyme linked immnuosorbent 
assay (DAS-ELISA) was used to detect the npt II enzyme with an NPT II-ELISA kit (5' 
prime to 3' prime. Inc., Boulder, Co.). 

Indirect ELISA: Polyclonal antibodies to GLRaV-2, which were prepared 
from the coat protein expressed in E, coli, were used. Plates were coated with homogenized 
samples in extraction buffer (1:10, w/v) (phosphate buffered saline containing 0.05% Tween 
20 and 2% polyvinyl pyrrolidone) and incubated overnight at 4''C. After washing with 
phosphate buffered saline containing 0.05% Tween 20 (PBST), the plates were blocked with 
blocking buffer (phosphate buffered saline containing 2% BSA) and incubated at room 
temperature for 1 hr. The anti-GLRaV-2 IgG was added at 2 |ig/ml after washing with 
PBST. After incubation at 30 C for 4 hr, the plates were washed with PBST, and the goat 
anti-rabbit IgG conjugate of alkaline phosphotase (Sigma) was added at 1 : 10,000 dilution. 
The absorbance was measured at 405 nm with a MicroELISA AutoReader. In addition, 
Western blot was also performed according to the method described by Hu et al., 
"Characterization of Closterovirus-like Particle Associated Grapevine Leafroll Disease," J. 
PhvtophatholoRY 128:1-14, (1990), which is incorporated herein by reference. 

PCR analysis: Genomic DNA was extracted from leaves of putative 
transgenic and non-transgenic plants according to the method described by Cheung et al., "A 
Simple and Rapid DNA Microextraction Method for Plants, Animal, and Insect Suitable for 
RAPD and other PCR analysis," PCR Methods and Applications 3:69 (1996), which is 
incorporated herein by reference. The extracted total DNA served as the template for PCR 
reaction. The primers CP-96F and CP-96R (SEQ. ID. Nos. 21 and 22, respectively) for the 
CP gene of GLRaV-2, as well as npt II 5'- and 3'- primers were used for PCR analysis. PCR 
reaction was performed at the 94°C x 3 min for one cycle, followed by 30 cycles of 94° C x 
1 min, 50"* C X 1 min, and 72° C x 2:30 min with an additional extension at 72° C for 10 min. 
The PCR product was analyzed on agarose gel. 

After transformation, a total of 42 kanamycin resistant Nicotiana benthamiana 
lines (Ro) were obtained, of which the leaf samples were tested by NPT II enzyme activity. 
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Among them, 37 lines were NPT II positive by ELISA, which took about 88.0% of total 
transformants. However, some of NPT II negative plants were obtained among these selected 
kanamycin resistant plants. All of the transgenic plants were self-pollinated in a greenhouse, 
and the seeds from these transgenic lines were germinated for further analysis. 
5 The production of GLRaV-2 CP in transgenic plants was detected by indirect 

ELISA prior to inoculation, and the results showed that GLRaV-2 CP gene expression was 
not detectable in all transgenic plants tested. This result was further confirmed with Western 
blot. Using the antibody to GLRaV-2, the production of the CP was not detected in the 
transgenic and nontransgenic control plants. However, a protein of expected size (-22 kDa) 

10 was detected in GLRaV-2 infected positive control plants. This result was consistent vyith the 
ELISA result. The presence of the CP gene of GLRaV-2 in transgenic plants was detected 
from total genomic DNA extracted from plants tissue by PCR analysis (Figure 12). The DNA 
product of expected size (653 bp) was amplified from twenty tested transgenic lines, but not 
in non-transgenic plants. The result indicated that the CP gene of GLRaV-2 was present at 

15 these transgenic lines, which was also confirmed by Northern blot analysis. 

Example 9 - Ri and R2 transgenic Nicotiana benihamiana Plants Are Resistant to 
GLRaV-2 

20 Inoculation of transgenic plants: GLRaV-2 isolate 94/970, which was 

originally identified and transmitted from grapevine to Nicotiana benihamiana in South 
Africa (Goszczynski et al., "Detection of Two Strains of Grapevine Leafroll-Associated 
Virus 2," Vitis 35:133-35 (1996), which is incorporated herein by reference), was used as 
inoculum. The CP gene of isolate 94/970 was sequenced; and it is identical to the CP gene 

25 used in construction. Nicotiana benthamiana is an experimental host of GLRaV-2. The 

infection on it produces chlorotic and occasional necrotic lesions followed by systemic vein 
clearing. The vein clearing results in vein necrosis. Eventually the infected plants died, 
starting from the top to the bottom. 

At five to seven leaf stage, two youngest apical leaves were challenged with 

30 GLRaV-2 isolate 94/970. Inoculum was prepared by grinding 1 .0 g GLRaV-2 infected 

Nicotiana benthamiana leaf tissue in 5 ml of phosphate buffer (O.OIM K2HP04, PH7.0). The 
tested plants were dusted with carborundum and rubbed with the prepared inoculum. Non- 
transformed plants were simultaneously inoculated as above. The plants were observed for 
symptom development every other day for 60 days after inoculation. Resistant Rl transgenic 



wo 98/53055 



-75- 



PCTAJS98/10313 



plants were carried on to R2 generation for further evaluation. 

Transgenic progenies from 20 Rq lines were initially screened for the 
resistance to GLRaV-2 followed by inoculation with GLRaV-2 isolate 94/970. The seedlings 
of the transgenic plants (NPT II positive), and nontransformed control plants were inoculated 
5 with GLRaV"2. After inoculation, the reaction of tested plants were divided into tlu-ee types: 
highly susceptible (i.e. typical symptoms were observed two to four weeks postinoculation); 
tolerant (i.e. no symptom was developed in the early stage and typical symptoms was shown 
four to eight weeks postinoculation); and resistant (i.e. the plants remained asymptomatic 
eight weeks postinoculation). Based on the plant reaction, the resistant plants were obtained 

10 from fourteen different lines (listed in Table 1 below). In each of these fourteen lines, there 
was no virus detected within these plants by ELISA at 6 weeks postinoculation. In contrast, 
GLRaV-2 was detected in symptomatic plants by indirect ELISA. In the other six lines, 
although there were a few plants with some kind of delay in symptom development, all the 
inoculated transgenic plants died at three to eight weeks postinoculation. Based on the initial 

1 5 screening results, five representative lines consisting of tliree resistant lines (1,4, and 19) and 
two susceptible lines (12 and 13) were selected for the further analysis. 



Table 1 








Reaction of Tested Plants 




No. Line 


No. 


HS 


T 


HR 


line 1 


39 


14 


3 


22 


line 2 


36 


7 


6 


23 


line 3 


38 


11 


4 


23 


line 4 


31 


4 


5 


22 


line 5 


33 


6 


13 


14 


line 6 


36 


4 


16 


16 


line 7 


32 


5 


9 


18 


line 8 


37 


22 


9 


6 


line 9 


36 


9 


12 


15 


line 10 


14 


13 


1 


0 


line 1 1 


13 


11 


2 


0 


line 12 


17 


16 


1 


0 


line 13 


16 


14 


0 


0 


line 14 


17 


17 


0 


0 


line 15 


32 


30 


2 


0 


line 16 


33 


6 


13 


14 


line 17 


12 


0 


1 


1 1 


line 19 


15 


0 


0 


15 


line 20 


19 


3 


0 


16 


line 21 


14 


1 


3 


10 


control 


15 


15 


0 


0 
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Table 1 

Reaction of Tested Plants 

No. Line No HS T HR_ 

No Line: include tiansgenic lines and nontransformed control; 
No: the number of transgenic and nontransformed plants; 

HS: highly susceptible, typical symptoms were observed two to four weeks after 
inoculation; 

T: tolerant, the. symptoms were observed five to eight weeks after inoculation; and 
HR: plants remain without asymptoms after eight weeks inoculation. 



Table 2 below shows the symptom development in transgenic plants relative 
to non-transgenic control plants in the five selected lines in separate experiments. Non- 
5 transgenic control plants were all infected two to four weeks after inoculation, which showed 
typical GLRaV-2 symptoms on Nicotiana benthomiana, including chlorotic and local lesions 
followed by systemic vein clearing and vein necrosis on the leaves. Tliree of the tested lines 
(1, 4, and 19) showed some resistance that was manifested by either an absence or a delay in 
symptom development. Two other lines, 12 and 13, developed symptoms at nearly the same 

1 0 time as the non-transformed control plants. From top to bottom, the leaves of infected plants 
gradually became yellow, wilted, and dried, and, eventually, the whole plants died. No 
matter when infection occurred, the eventual result was the same. Six weeks after 
inoculation, all non-transgenic plants and the susceptible plants were dead. Some tolerant 
plants started to die. In contrast, the asymptomatic plants were flowering normally and 

15 pollinating as the non-inoculated healthy control plants (Figure 13). 



Table 2 



No. Line 


No. 


HS 


Reaction of Tested Plants 
T 


HR 


line 1 


19 


5 


6 


8 


line 4 


15 


9 


1 


5 


line 12 


16 


14 


2 


0 


line 13 


18 


13 


5 


0 


line 19 


13 


10 


0 




non-transgenic 


24 


23 


1 


0 



No. Line: incude transgenic lines and nontransformed control; 

No.: Number of transgenic and nontransformed plants tested; 

HS: highly susceptible; typical symptoms were observed two to four weeks after 

inoculation; 

T: tolerant, the symptoms were observed five to eight weeks postinoculation; and 
HR: plants remain without asymptoms after eight weeks inoculation. 



ELISA was performed at 6 weeks postinoculation to test the GLRaV-2 
replication in the plants. Presumably, the increased level of CP reflected virus replication. 
20 The result showed that the absorbance value in symptomatic plants reached (OD) 0.7 to 3.2, 
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compared to (OD) 0.10-0.13 prior to inoculation. In contrast, GLRaV-2 was not detected in 
asymptomatic plants, of which the absorbance value was the same or nearly the same as that 
of healthy nontransformed control plants. The data confirmed that virus replicated in 
symptomatic plants, but not in asymptomatic plants. The replication of GLRaV-2 was 

5 suppressed in asymptomatic plants. This result implicated that another mechanism other than 
the CP-mediated resistance was probably involved. 

Three R2 progenies derived from transgenic resistant plants of lines 1, 4, and 
19 were generated and utilized to examine the stable transmission and whether resistance was 
maintained in R2 generation. These results are shown in Table 3 below. NPT II analysis 

10 revealed that R2 progeny were still segregating. The CP expression in R2 progeny was still 
undetectable. After inoculation, all the nontransgenic plants were infected and showed 
GLRaV-2 symptoms on the leaves after 24 days postinoculation. In contrast, the inoculated 
transgenic R2 progeny showed different levels of resistance from those highly susceptible to 
highly resistant. The tolerant and resistant plants were manifested by a delay in symptom 

15 development and absence of symptoms, respectively. At 6 weeks postinoculation, GLRaV-2 
was detected in the tolerant symptomatic infected plants by indirect ELISA; but not in 
asymptomatic plants. This result indicated that virus replication was suppressed in these 
resistant plants, which was confirmed by Western blot. These resistant plants remained 
asymptomatic eight weeks postinoculation, and they were flowering normally and 

20 pollinating. 



Table 3 







NPT 11 




Reaction of Tested Plants 


'No. Line 


No. Plants 


positive/negative 


HS 


T 


HR 


line 1/22 


12 


12/20 


3 


:> 


6 


line 1/30 


11 


8/3 


7 


2 


2 


line 1/31 


11 


10/1 


6 


3 


2 


line 1/35 


10 


10/0 


4 


6 


0 


line 1/41 


8 


7/1 


2 


2 


4 


line 4/139 


12 


11/1 


4 


4 


3 


line 4/149 


10 


7/3 


4 


5 


1 


line 4/152 


10 


8/2 


9 


0 


1 


line 4/174 


9 


8/1 


4 


0 


4 


line 19/650 


11 


10/1 


7 


0 


2 


line 19/657 


12 


12/0 


6 


2 


4 


line 19/659 


12 


8/4 


5 


2 


5 


line 19/660 


10 


8/2 


3 


6 


1 


n on -trans formed 


12 


0/12 


12 


0 


0 



CK 

HS: highly susceptible, typical symptoms were observed two to four weeks after inoculation; 

T: tolerant, the symptoms were observed five to eight weeks postinoculation; and 

HR: plants remain asymptomatic at eight weeks postinoculation, 
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Example 10 - Evidence for RNA-Mediated Protection in Transgenic Plants 

Northern blot analysis: Total RNA was extracted from leaves prior to 
inoculation follov^ing the method described by Napoli et al., Plant Cell 2:279-89 (1990), 
5 which is hereby incorporatd by reference. The concentration of the extracted RNA was 
measured by spectrophotometer at OD 260. About 10 g of total RNA was used for each 
sample. The probe used was the 3' one third of GLRaV-2 CP gene, which was randomly 
labeled with ^^P (a-dATP) using Klenow fragment of DNA polymerase L 

Using a DNA corresponding to the 3' one third CP gene sequence as probe, a 

1 0 single band was detected in the RNA extracted from susceptible plants from R] progeny of 
lines 5, 12, and 13 by Northern hybridization. There was little or no signal detected in the 
transgenic plants from Rl progeny of line 1,4, and 19. This RNA is not present in 
nontransformed control plants. The size of the hybridization signal was estimated to an 
approximately 0.9 kb nucleic acid, which was about the same as estimated (Figure 14). In 

1 5 lines of 1 , 4, and 1 9, the steady state level of PIN A expression was also low in R2 progeny. 
This data showed that susceptible plants from lines 12 and 13 had high niRNA level and all 
transgenic plants from lines 1, 4, and 19 had low mRNA level. 

Example 11 - Transformation and Analysis of Transgenic Grapevines with the CP 
20 Geneof GLRaV-2 

Plant materials: The rootstock cultivars Couderc 3309 (3309C) (V, ripariax V. 
rupestris), Vitis riparia 'Gloire de Montpellier' (Gloire), Teleki 5C (5C) (K berlandieri x V. 
riparia\ Millardet et De Grasset 101-14 (101-14 MGT) (K ripariax V. rupestris), and 

25 Richter 1 10 (1 1 OR) {V, rupestris x V. berlandieri) were utilized. Initial embryogenic calli of 
Gloire were provided by Mozsar and Siile (Plant Protection Institute, Hungarian Academy of 
Science, Budapest). All other plant materials came from a vineyard at the New York State 
Agricultural Experiment Station, Geneva, NY. Buds were removed from the clusters and 
surface sterilized in 70% ethanol for 1-2 min. The buds (from the greenhouse and the field) 

30 were transferred to 1% sodium hypochlorite for 15 min, then rinsed three times in sterile 
double-distilled water. Anthers were excised aseptically from flower buds with the aid of a 
stereo microscope. The pollen was crushed on a microscope slide under a coverslip with a 
drop of acetocarmine to observe the cytological stage. This was done to determine which 
stage was most favorable for callus induction. 
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Somatic embryogenesis and regeneration: Anthers were plated under aseptic 
conditions at a density of 40 to 50 per 9 cm diameter Petri dish containing MSE. Plates were 
cultured at 28^C in the dark. Callus was initiated, and, after 60 days, embryos were induced 
and were transferred to hormone-free HMG medium for differentiation. Torpedo stage 
embryos were then transferred from HMG to MGC medium to promote embryo germination. 
Cultures were maintained in the dark at 26-28°C and transferred to fresh medium at 3-4 week 
intervals. Elongated embryos were transferred to rooting medium in baby food jars (5-8 
embryos per jar). The embryos were grown in a tissue culture room at 25°C with a daily 16 h 
photoperiod (76 :mol. s) to induce shoot and root formation. After plants developed roots, 
they were transplanted to soil in the greenliouse. 

Transformation: The protocols used for transformation were modified from 
those described by Scorza et.al., "Transformation of Grape (Viiis vinifera L) Zygotic-derived 
Somatic Embryos and Regeneration of Transgenic Plants," Plant Cell Rpt. 14:589-92 (1995), 
which is hereby incorporated by reference. Overnight cultures of Agrobacterium strain 
C58Z707 or LBA4404 were grown in LB medium at 28°C in a shaking incubator. Bacteria 
were centrifuged for 5 min at 3000-5000 rpm and resuspended in MS liquid medium (OD 1 .0 
at A600 nm ). Calli with embryos were immersed in the bacterial suspension for 1 5-30 min, 
blotted dry, and transferred to HMG medium with or without acetosyringone (100 i-iM). 
Embryogenic calli were co-cultivated with the bacteria for 48 h in the dark at 28*'C. Then, 
the plant material was washed in MS liquid plus cefotaxime (300 mg/ml) and carbenicillin 
(200 mg/ml) 2-3 times. To select transgenic embi70s, the material was transferred to HMG 
medium containing either 20 or 40 mg/L kanamycin, 300 mg/L cefotaxime, and 200 mg/L 
carbenicillin. Alternatively, after co-cultivation, embryogenic calli were transferred to 
initiation MSE medium containing 25 mg/1 kanamycin plus the same antibiotics listed above. 
All plant materials were incubated in continuous dark at 28°C. After growth on selection 
medium for 3 months, embryos were transferred to HMG or MGC without kanamycin to 
promote elongation of embryos. They were then transferred to rooting medium without 
antibiotics. Nontransformed calli were grown on the same media with and without 
kanamycin to verify the efficiency of the kanamycin selection process. 

Although the invention has been described in detail for the purpose of 
illustration, it is understood that such detail is solely for that purpose, and variations can be 
made therein by those skilled in the art without departing from the spirit and scope of the 
invention which is defined by the following claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Cornell Research Foundation, Inc. 

(ii) TITLE OF IIWENTION: GRAPEVINE LEAFROLL VIRUS TYPE 2 PROTEINS 

AND THEIR USES 

(iii) NUMBER OF SEQUENCES: 23 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Nixon, Hargrave, Devans & Doyle LLP 

(B) STREET: Clinton Square, P.O. Box 1051 

(C) CITY: Rochester 

(D) STATE: New York 

(E) COUNTRY: U.S.A. 

(F) ZIP: 14603 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/047,194 

(B) FILING DATE: 20-MAY-1997 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Goldman, Michael L. 

(B) REGISTRATION IsFUMBER : 30,727 

(C) REFERENCE/DOCKET NUMBER: 19603/1632 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (716) 263-1304 

(B) TELEFAX: (716) 263-1600 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15500 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : - 

TAAACATTGC GAGAGJ^CCC CATTAGCGTC TCCGGGGTGA ACTTGGGAAG GTCTGCCGCC 6 0 

GCTCAGGTTA TTTATTTCGG CAGTTTCACG CAGCCCTTCG CGTTGTATCC GCGCCAAGAG 12 0 

AGCGCGATCG TAAAAACGCA ACTTCCACCG GTCAGTGTAG TGAAGGTGGA GTGCGTAGCT 180 

GCGGAGGTAG CTCCCGACAG GGGCGTGGTC GACAAGAAAC CTACGTCTGT TGGCGTTCCC 24 0 

CCGCAGCGCG GTGTGCTTTC TTTTCCGACG GTGGTTCGGA ACCGCGGCGA CGTGATAATC 3 00 

ACAGGGGTGG TGCATGAAGC CCTGAAGAAA ATTAAAGACG GGCTCTTACG CTTCCGCGTA 360 

GGCGGTGACA TGCGTTTTTC GAGATTTTTC TCATCGAACT ACGGCTGCAG ATTCGTCGCG 420 

AGCGTGCGTA CGAACACTAC AGTTTGGCTA AATTGCACGA AAGCGAGTGG TGAGAJU^TTC 4 80 

TCACTCGCCG CCGCGTGCAC GGCGGATTAC GTGGCGATGC TGCGTTATGT GTGTGGCGGG 54 0 

AAATTTCCAC TCGTCCTCAT GAGTAGAGTT ATTTACCCGG ATGGGCGCTG TTACTTGGCC 6 00 

CATATGAGGT ATTTGTGCGC CTTTTACTGT CGCCCGTTTA GAGAGTCGGA TTATGCCCTC 66 0 

GGAATGTGGC CTACGGTGGC GCGTCTCAGG GCATGCGTTG AGAAGAACTT CGGTGTCGAA 72 0 

GCTTGTGGCA TAGCTCTTCG TGGCTATTAC ACCTCTCGCA ATGTTTATCA CTGTGATTAT 780 

GACTCTGCTT ATGTAAAATA TTTTAGAAAC CTTTCCGGCC GCATTGGCGG TGGTTCGTTC 84 0 

GATCCGACAT CTTTAACCTC CGTAATAACG GTGAAGATTA GCGGTCTTCC AGGTGGTCTT 900 

CCTAAAAATA TAGCGTTTGG TGCCTTCCTG TGCGATATAC GTTACGTCGA ACCGGTAGAC 96 0 

TCGGGCGGCA TTCAATCGAG CGTTAAGACG AAACGTGAAG ATGCGCACCG AACCGTAGAG 102 0 

GAACGGGCGG CCGGCGGATC CGTCGAGCAA CCGCGACAAA AGAGGATAGA TGAGAAAGGT 108 0 

TGCGGCAGAG TTCCTAGTGG AGGTTTTTCG CATCTCCTGG TCGGCAACCT TAACGAAGTT 114 0 

AGGAGGAAGG TAGCTGCCGG ACTTCTACGC TTTCGCGTTG GCGGTGATAT GGATTTTCAT 1200 

CGCTCGTTCT CCACCCAAGC GGGCCACCGC TTGCTGGTGT GGCGCCGCTC GAGCCGGAGC 12 6 0 

GTGTGCCTTG AACTTTACTC ACCATCTAAA AACTTTTTGC GTTACGATGT CTTGCCCTGT 13 2 0 

TCTGGAGACT ATGCAGCGAT GTTTTCTTTC GCGGCGGGCG GCCGTTTCCC TTTAGTTTTG 13 8 0 

ATGACTAGAA TTAGATACCC GAACGGGTTT TGTTACTTGG CTCACTGCCG GTACGCGTGC 14 4 0 

GCGTTTCTCT TAAGGGGTTT TGATCCGAAG CGTTTCGACA TCGGTGCTTT CCCCACCGCG 1500 

GCCAAGCTCA GAAACCGTAT GGTTTCGGAG CTTGGTGAAA GAAGTTTAGG TTTGAACTTG 1560 

TACGGCGCAT ATACGTCACG CGGCGTCTTT CACTGCGATT ATGACGCTAA GTTTATAAAG 162 0 
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1800 
1860 



GATTTGCGTC TTATGTCAGC AGTTATAGCT GGAAAGGACG GGGTGGAAGA GGTGGTACCT 1680 

TCTGACATAA CTCCTGCCAT Gi=^GCAGAAA ACGATCGAAG CCGTGTATGA TAGATTATAT 174 0 
GGCGGCACTG ACTCGTTGCT GAAACTGAGC ATCGAGJ\AAG ACTTAATCGA TTTCAAAAAT 
GACGTGCAGA GTTTGAAGAA AGATCGGCCG ATTGTCT^G TGCCCTTTTA CATGTCGGAA 

GCAACACAGA ATTCGCTGAC GCGTTTCTAC CCTCAGTTCG AACTTAAGTT TTCGCACTCC 1920 

TCGCATTCAG ATCATCCCGC CGCCGCCGCT TCTAGACTGC TGGAAAATGA AACGTTAGTG 1980 

CGCTTATGTG GT7VATAGCGT TTCAGATATT GGAGGTTGTC CTCTTTTCCA TTTGCATTCC 2 04 0 

AAGACGCAAA GACGGGTTCA CGTATGTAGG CCTGTGTTGG ATGGCAAGGA TGCGCAGCGT 2100 

CGCGTGGTGC GTGATTTGCA GTATTCCAAC GTGCGTTTGG GAGACGATGA TAAAATTTTG 2160 

GAAGGGCCAC GCAATATCGA CATTTGCCAC TATCCTCTGG GCGCGTGTGA CCACGAAAGT 2220 

AGTGCTATGA TGATGGTGCA GGTGTATGAC GCGTCCCTTT ATGAGATATG TGGCGCCATG 22 8 0 

ATCAAGAAGA AAAGCCGCAT AACGTACTTA ACCATGGTCA CGCCCGGCGA GTTTCTTGAC 234 0 

GGACGCGAAT GCGTCTACAT GGAGTCGTTA GACTGTGAGA TTGAAGTTGA TGTGCACGCG 24 00 

GACGTCGTAA TGTACAAATT CGGTAGTTCT TGCTATTCGC ACAAGCTTTC AATCATCAAG 2460 

GACATCATGA CCACTCCGTA CTTGACACTA GGTGGTTTTC TATTCAGCGT GGAGATGTAT 2 52 0 

GAGGTGCGTA TGGGCGTGAA TTACTTCAAG ATTACGAAGT CCGAAGTATC GCCTAGCATT 2 580 

AGCTGCACCA AGCTCCTGAG ATACCGAAGA GCTAATAGTG ACGTGGTTAA AGTTAAACTT 2 64 0 

CCACGTTTCG ATAAGAAACG TCGCATGTGT CTGCCTGGGT ATGACACCAT ATACCTAGAT 2 700 

TCGT^GTTTG TGAGTCGCGT TTTCGATTAT GTCGTGTGTA ATTGCTCTGC CGTGAACTCA 2 760 

AAAACTTTCG AGTGGGTGTG GAGTTTCATT AAGTCTAGTA AGTCGAGGGT GATTATTAGC 2 82 0 

GGTAAAATAA TTCACAAGGA TGTGAATTTG GACCTCAAGT ACGTCGAGAG TTTCGCCGCG 2 880 

GTTATGTTGG CCTCTGGCGT GCGCAGTAGA CTAGCGTCCG AGTACCTTGC TAAGAACCTT 2 94 0 

AGTCATTTTT CGGGAGATTG CTCCTTTATT GAAGCCACGT CTTTCGTGTT GCGTGAGAAA 3 000 

ATCAGAAACA TGACTCTGAA TTTTAACGAA AGACTTTTAC AGTTAGTGAA GCGCGTTGCC 3 06 0 

TTTGCGACCT TGGACGTGAG TTTTCTAGAT TTAGATTCAA CTCTTGAATC AATAACTGAT 312 0 

TTTGCCGAGT GTAAGGTAGC GATTGAACTC GACGAGTTGG GTTGCTTGAG AGCGGAGGCC 3180 

GAGAATGAAA AAATCAGGAA TCTGGCGGGA GATTCGATTG CGGCTTU^ACT CGCGAGCGAG 324 0 

ATAGTGGTCG ATATTGACTC TAAGCCTTCA CCGAAGCAGG TGGGTAATTC GTCATCCGAA 33 00 
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AACGCCGATA AGCGGGAAGT TCAGAGGCCC GGTTTGCGTG GTGGTTCTAG AAACGGGGTT 3360 

GTTGGGGAGT TCCTTCACTT CGTCGTGGAT TCTGCCTTGC GTCTTTTCAA ATACGCGACG 3 420 

GATCAACAAC GGATCAAGTC TTACGTGCGT TTCTTGGACT CGGCGGTCTC ATTCTTGGAT 34 80 

TACAACTACG ATAATCTATC GTTTATACTG CGAGTGCTTT CGGAAGGTTA TTCGTGTATG 3 540 

TTCGCGTTTT TGGCGAATCG CGGCGACTTA TCTAGTCGTG TCCGTAGCGC GGTGTGTGCT 3 6 00 

GTGAAAGAAG TTGCTACCTC ATGCGCGAAC GCGAGCGTTT CTAAAGCCAA GGTTATGATT 3660 

ACCTTCGCAG CGGCCGTGTG TGCTATGATG TTTAATAGCT GCGGTTTTTC AGGCGACGGT 3 720 

CGGGAGTATA AATCGTATAT ACATCGTTAC ACGCAAGTAT TGTTTGACAC TATCTTTTTT 3 7 80 

GAGGACAGCA GTTACCTACC CATAGAAGTT CTGAGTTCGG CGATATGCGG TGCTATCGTC 3 84 0 

ACACTTTTCT CCTCGGGCTC GTCCATAAGT TT7VAACGCCT TCTTACTTCA AATTACCAAA 3 900 

GGATTCTCCC TAGAGGTTGT CGTCCGGAAT GTTGTGCGAG TCACGCATGG TTTGAGCACC 3 96 0 

ACAGCGACCG ACGGCGTCAT ACGTGGGGTT TTCTCCCAAA TTGTGTCTCA CTTACTTGTT 4 02 0 

GGAAATACGG GTAATGTGGC TTACCAGTCA GCTTTCATTG CCGGGGTGGT GCCTCTTTTA 4 08 0 

GTTAAAAAGT GTGTGAGCTT AATCTTCATC TTGCGTGAAG ATACTTATTC CGGTTTTATT 414 0 

AAGCACGGAA TCAGTGAATT CTCTTTCCTT AGTAGTATTC TGAAGTTCTT GAAGGGTAAG 4 2 00 

CTTGTGGACG AGTTGAAATC GATTATTCAA GGGGTTTTTG ATTCCAACAA GCACGTGTTT 426 0 

AAAGAAGCTA CTCAGGAAGC GATTCGTACG ACGGTCATGC AAGTGCCTGT CGCTGTAGTG 432 0 

GATGCCCTTA AGAGCGCCGC GGGAAAAATT TATAACAATT TTACTAGTCG ACGTACCTTT 4 380 

GGTAAGGATG AAGGCTCCTC TAGCGACGGC GCATGTGAAG AGTATTTCTC ATGCGACGAA 4 44 0 

GGTGAAGGTC CGGGTCTGAA AGGGGGTTCC AGCTATGGCT TCTCAATTTT AGCGTTCTTT 4 500 

TCACGCATTA TGTGGGGAGC TCGTCGGCTT ATTGTTAAGG TGAAGCATGA GTGTTTTGGG 4 56 0 

AAACTTTTTG AATTTCTATC GCTCTlAGCTT CACGAATTCA GGACTCGCGT TTTTGGGAAG 4 620 

AATAGAACGG ACGTGGGAGT TTACGATTTT TTGCCCACGG GCATCGTGGA AACGCTCTCA 4 680 

TCGATAGAAG AGTGCGACCA AATTGAAGAA CTTCTCGGCG ACGACCTGAA AGGTGACAAG 4 74 0 

GATGCTTCGT TGACCGATAT GAATTACTTT GAGTTCTCAG TVAGACTTCTT AGCCTCTATC 4 8 00 

GAGGAGCCGC CTTTCGCTGG ATTGCGAGGA GGTAGCAAGA ACATCGCGAT TTTGGCGATT 4 8 60 

TTGGAATACG CGCATAATTT GTTTCGCATT GTCGCAAGCA AGTGTTCGAA ACGACCTTTA 4 920 

TTTCTTGCTT TCGCCGAACT CTCAAGCGCC CTTATCGAGA 7VATTTAAGGA GGTTTTCCCT 4 98 0 

CGTAAGAGCC AGCTCGTCGC TATCGTGCGC GAGTATACTC AGAGATTCCT CCGAAGTCGC 5 04 0 
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ATGCGTGCGT TGGGTTTGAA TAACGAGTTC GTGGTAJlAAT CTTTCGCCGA TTTGCTACCC 5100 

GCATTAATGA AGCGGAAGGT TTCAGGTTCG TTCTTAGCTA GTGTTTATCG CCCACTTAGA 516 0 

GGTTTCTCAT ATATGTGTGT TTCAGCGGAG CGACGTGAAA AGTTTTTTGC TCTCGTGTGT 522 0 

TTAATCGGGT TAAGTCTCCC TTTCTTCGTG CGCATCGTAG GAGCGAAAGC GTGCGAAGAA 52 8 0 

CTCGTGTCCT CAGCGCGTCG CTTTTATGAG CGTATTAAAA TTTTTCTAAG GCAGAAGTAT 5 34 0 

GTCTCTCTTT CTAATTTCTT TTGTCACTTG TTTAGCTCTG ACGTTGATGA CAGTTCCGCA 54 0 0 

TCTGCAGGGT TGAAAGGTGG TGCGTCGCGA ATGACGCTCT TCCACCTTCT GGTTCGCCTT 54 6 0 

GCTAGTGCCC TCCTATCGTT AGGGTGGGAA GGGTTAAAGC TACTCTTATC GCACCACAAC 5 52 0 

TTGTTATTTT TGTGTTTTGC ATTGGTTGAC GATGTGAACG TCCTTATCAA AGTTCTTGGG 5 58 0 

GGTCTTTCTT TCTTTGTGCA ACCAATCTTT TCCTTGTTTG CGGCGATGCT TCTACAACCG 564 0 

GACAGGTTTG TGGAGTATTC CGAGAAACTT GTTACAGCGT TTGAATTTTT CTTAAAATGT 5 7 00 

TCGCCTCGCG CGCCTGCACT ACTCAAAGGG TTTTTTGAGT GCGTGGCGAA CAGCACTGTG 5 76 0 

TCAAAAACCG TTCGAAGACT TCTTCGCTGT TTCGTGAAGA TGCTCAAACT TCGAAAAGGG 5 82 0 

CGAGGGTTGC GTGCGGATGG TAGGGGTCTC CATCGGCAGA AAGCCGTACC CGTCATACCT 5 88 0 

TCTAATCGGG TCGTGACCGA CGGGGTTGAA AGACTTTCGG TAAAGATGCA AGGAGTTGAA 5 94 0 

GCGTTGCGTA CCGAATTGAG AATCTTAGAA GATTTAGATT CTGCCGTGAT CGAAAAACTC 6 00 0 

AATAGACGCA GAAATCGTGA CACTAATGAC GACGAATTTA CGCGCCCTGC TCATGAGCAG 6 06 0 

ATGCAAGAAG TCACCACTTT CTGTTCGAAA GCCAACTCTG CTGGTTTGGC CCTGGAAAGG 612 0 

GCAGTGCTTG TGGAAGACGC TATAAAGTCG GAGAAACTTT CTAAGACGGT TAATGAGATG 618 0 

GTGAGGAAAG GGAGTACCAC CAGCGAAGAA GTGGCCGTCG CTTTGTCGGA CGATGAAGCC 6 24 0 

GTGGAAGAAA TCTCTGTTGC TGACGAGCGA GACGATTCGC CTAAGACAGT CAGGATAAGC 6 3 00 

GAATACCTAA ATAGGTTAAA CTCAAGCTTC GAATTCCCGA AGCCTATTGT TGTGGACGAC 6 36 0 

AACAAGGATA CCGGGGGTCT AACGAACGCC GTGAGGGAGT TTTATTATAT GCAAGAACTT 6 42 0 

GCTCTTTTCG AAATCCACAG CAAACTGTGC ACCTACTACG ATCAACTGCG CATAGTCAAC 6 4 80 

TTCGATCGTT CCGTAGCACC ATGCAGCGAA GATGCTCAGC TGTACGTACG GAAGAACGGC 6 54 0 

TCAACGATAG TGCAGGGTAA AGAGGTACGT TTGCACATTA AGGATTTCCA CGATCACGAT 6 6 00 

TTCCTGTTTG ACGGAAA7VAT TTCTATTAAC AAGCGGCGGC GAGGCGGAAA TGTTTTATAT 6 66 0 

CACGACAACC TCGCGTTCTT GGCGAGTAAT TTGTTCTTAG CCGGCTACCC CTTTTCAAGG 6 72 0 
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AGCTTCGTCT TCACGAATTC GTCGGTCGAT ATTCTCCTCT ACGAAGCTCC ACCCGGAGGT 6 780 

GGTAAGACGA CGACGCTGAT TGACTCGTTC TTGAAGGTCT TCAAGAAAGG TGAGGTTTCC 684 0 

ACCATGATCT TAACCGCCAA CAAAAGTTCG CAGGTTGAGA TCCTAAAGAA AGTGGAGAAG 6 900 

GAAGTGTCTA ACATTGAATG CCAGAAACGT AAAGACAAAA GATCTCCGAA AAAGAGCATT 6 960 

TACACCATCG ACGCTTATTT AATGCATCAC CGTGGTTGTG ATGCAGACGT TCTTTTCATC 7020 

GATGAGTGTT TCATGGTTCA TGCGGGTAGC GTACTAGCTT GCATTGAGTT CACGAGGTGT 7080 

CATAAAGTAA TGATCTTCGG GGATAGCCGG CAGATTCACT ACATTGTUUVG GAACGAATTG 714 0 

GACAAGTGTT TGTATGGGGA TCTCGACAGG TTCGTGGACC TGCAGTGTCG GGTTTATGGT 7200 

AATATTTCGT ACCGTTGTCC ATGGGATGTG TGCGCTTGGT TAAGCACAGT GTATGGCAAC 726 0 

CTAATCGCCA CCGTGAAGGG TGAAAGCGAA GGTAAGAGCA GCATGCGCAT TAACGAAATT 7 32 0 

AATTCAGTCG ACGATTTAGT CCCCGACGTG GGTTCCACGT TTCTGTGTAT GCTTCAGTCG 7380 

GAGAAGTTGG AAATCAGCAA GCACTTTATT CGCAAGGGTT TGACTAAACT TAACGTTCTA 744 0 

ACGGTGCATG AGGCGCAAGG TGAGACGTAT GCGCGTGTGA ACCTTGTGCG ACTTAAGTTT 7 500 

CAGGAGGATG AACCCTTTAA ATCTATCAGG CACATAACCG TCGCTCTTTC TCGTCACACC 7 56 0 

GACAGCTTAA CTTATAACGT CTTAGCTGCT CGTCGAGGTG ACGCCACTTG CGATGCCATC 762 0 

CAGAAGGCTG CGGAATTGGT GAACAAGTTT CGCGTTTTTC CTACATCTTT TGGTGGTAGT 768 0 

GTTATCAATC TCAACGTGAA GAAGGACGTG GAAGATAACA GTAGGTGCAA GGCTTCGTCG 7 74 0 

GCACCATTGA GCGTAATCAA CGACTTTTTG AACGAAGTTA ATCCCGGTAC TGCGGTGATT 7 8 00 

GATTTTGGTG ATTTGTCCGC GGACTTCAGT ACTGGGCCTT TTGAGTGCGG TGCCAGCGGT 7 86 0 

ATTGTGGTGC GGGACAACAT CTCCTCCAGC AACATCACTG ATCACGATAA GCAGCGTGTT 7 920 

TAGCGTAGTT CGGTCGCAGG CGATTCCGCG TAGAAAACCT TCTCTACAAG AAAATTTGTA 7980 

TTCGTTTGAA GCGCGGAATT ATAACTTCTC GACTTGCGAC CGTAACACAT CTGCTTCAAT 804 0 

GTTCGGAGAG GCTATGGCGA TGAACTGTCT TCGTCGTTGC TTCGACCTAG ATGCCTTTTC 8100 

GTCCCTGCGT GATGATGTGA TTAGTATCAC ACGTTCAGGC ATCGAACAAT GGCTGGAGAA 8160 

ACGTACTCCT AGTCAGATTA AAGCATTAAT GAAGGATGTT GAATCGCCTT TGGAAATTGA 822 0 

CGATGAAATT TGTCGTTTTA AGTTGATGGT GAAGCGTGAC GCTAAGGTGA AGTTAGACTC 82 80 

TTCTTGTTTA ACTAAACACA GCGCCGCTCA AAATATCATG TTTCATCGCA AGAGCATTAA 834 0 

TGCTATCTTC TCTCCTATCT TTAATGAGGT GAAAAACCGA ATAATGTGCT GTCTTAAGCC 84 00 

TAACATAAAG TTTTTTACGG AGATGACTAA CAGGGATTTT GCTTCTGTTG TCAGCAACAT 84 60 
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GCTTGGTGAC 


GACGATGTGT 


ACCATATAGG 


TGAAGTTGAT 


TTCTCAAAGT 


ACGACAAGTC 


8520 


TCAAGATGCT 


TTCGTGAAGG 


CTTTTGAAGA 


AGTAATGTAT 


AAGGAACTCG 


GTGTTGATGA 


8580 


AGAGTTGCTG 


GCTATCTGGA 


TGTGCGGCGA 


GCGGTTATCG 


ATAGCTAACA 


CTCTCGATGG 


8640 


TCAGTTGTCC 


TTCACGATCG 


AGAATCAAAG 


GTIAGTCGGGA 


GCTTCGAACA 


CTTGGATTGG 


8700 


TAACTCTCTC 


GTCACTTTGG 


GTATTTTAAG 


TCTTTACTAC 


GACGTTAGAA 


ATTTCGAGGC 


8760 


GTTGTACATC 


TCGGGCGATG 


ATTCTTTAAT 


TTTTTCTCGC 


AGCGAGATTT 


CGAATTATGC 


8820 


CGACGACATA 


TGCACTGACA 


TGGGTTTTGA 


GACAAAATTT 


ATGTCCCCAA 


GTGTCCCGTA 


8880 


CTTTTGTTCT 


AAATTTGTTG 


TTATGTGTGG 


TCATAAGACG 


TTTTTTGTTC 


CCGACCCGTA 


8940 


CAAGCTTTTT 


GTCAAGTTGG 


GAGCAGTCAA 


AGAGGATGTT 


TCT^TGGATT 


TCCTTTTCGA 


9000 


GACTTTTACC 


TCCTTTAAAG 


ACTTAACCTC 


CGATTTTAAC 


GACGAGCGCT 


TAATTCAAAA 


9060 


GCTCGCTGAA 


CTTGTGGCTT 


TAAAATATGA 


GGTTCAAACC 


GGCAACACCA 


CCTTGGCGTT 


9120 


AAGTGTGATA 


CATTGTTTGC 


GTTCGAATTT 


CCTCTCGTTT 


AGCAAGTTAT 


ATCCTCGCGT 


9180 


GAAGGGATGG 


CAGGTTTTTT 


ACACGTCGGT 


TAAGAAAGCG 


CTTCTCAAGA 


GTGGGTGTTC 


9240 


TCTCTTCGAC 


AGTTTCATGA 


CCCCTTTTGG 


TCAGGCTGTC 


ATGGTTTGGG 


ATGATGAGTA 


9300 


GCGCTAACTT 


GTGCGCAGTT 


TCTTTGTTCG 


TGACATACAC 


CTTGTGTGTC 


ACCGTGCGTT 


9360 


TATAATGAAT 


CAGGTTTTGC 


AGTTTGAATG 


TTTGTTTCTG 


CTGAATCTCG 


CGGTTTTTGC 


9420 


TGTGACTTTC 


ATTTTCATTC 


TTCTGGTCTT 


CCGCGTGATT 


AAGTCTTTTC 


GCCAGAAGGG 


9480 


TCACGAAGCA 


CCTGTTCCCG 


TTGTTCGTGG 


CGGGGGTTTT 


TCAACCGTAG 


TGTAGTCAAA 


9540 


AGACGCGCAT 


ATGGTAGTTT 


TCGGTTTGGA 


CTTTGGCACC 


ACATTCTCTA 


CGGTGTGTGT 


9600 


GTACAAGGAT 


GGACGAGTTT 


TTTCATTCAA 


GCAGAATAAT 


TCGGCGTACA 


TCCCCACTTA 


9660 


CCTCTATCTC 


TTCTCCGATT 


CTAACCACAT 


GACTTTTGGT 


TACGAGGCCG 


AATCACTGAT 


9720 


GAGTAATCTG 


AAAGTTAAAG 


GTTCGTTTTA 


TAGAGATTTA 


AAACGTTGGG 


TGGGTTGCGA 


9780 


TTCGAGTAAC 


CTCGACGCGT 


ACCTTGACCG 


TTTAAAACCT 


CATTACTCGG 


TCCGCTTGGT 


9840 


TAAGATCGGC 


TCTGGCTTGA 


ACGAAACTGT 


TTCAATTGGA 


AACTTCGGGG 


GCACTGTTAA 


9900 


GTCTGAGGCT 


CATCTGCCAG 


GGTTGATAGC 


TCTCTTTATT 


AAGGCTGTCA 




996 0 


GGAGGGCGCG 


TTTGCGTGCA 


CTTGCACCGG 


GGTTATTTGT 


TCAGTACCTG 


CCAATTATGA 


10020 


TAGCGTTCAA 


AGGAATTTCA 


CTGATCAGTG 


TGTTTCACTC 


AGCGGTTATC 


AGTGCGTATA 


10080 


TATGATCAAT 


GAACCTTCAG 


CGGCTGCGCT 


ATCTGCGTGT 


AATTCGATTG 


GAAAGAAGTC 


10140 
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CGCAAATTTG GCTGTTTACG ATTTCGGTGG TGGGACCTTC GACGTGTCTA TCATTTCATA 10200 
CCGCAACAAT ACTTTTGTTG TGCGAGCTTC TGGAGGCGAT CTAAATCTCG GTGGAAGGGA 10260 
TGTTGATCGT GCGTTTCTCA CGCACCTCTT CTCTTTAACA TCGCTGGAAC CTGACCTCAC 1032 0 
TTTGGATATC TCGAATCTGA AAGAATCTTT ATCAAAAACG GACGCAGAGA TAGTTTACAC 1038 0 
TTTGAGAGGT GTCGATGGAA GAAAAGAAGA CGTTAGAGTA AACAAAAACA TTCTTACGTC 10440 
GGTGATGCTC CCCTACGTGA ACAGAACGCT TAAGATATTA GAGTCAACCT TAAAATCGTA 10500 
TGCTAAGAGT ATGAATGAGA GTGCGCGAGT TAAGTGCGAT TTAGTGCTGA TAGGAGGATC 10560 
TTCATATCTT CCTGGCCTGG CAGACGTACT AACGAAGCAT CAGAGCGTTG ATCGTATCTT 10620 
AAGAGTTTCG GATCCTCGGG CTGCCGTGGC CGTCGGTTGC GCATTATATT CTTCATGCCT 10680 
CTCAGGATCT GGGGGGTTGC TACTGATCGA CTGTGCAGCT CACACTGTCG CTATAGCGGA 1074 0 
CAGAAGTTGT CATCAAATCA TTTGCGCTCC AGCGGGGGCA CCGATCCCCT TTTCAGGAAG 10800 
CATGCCTTTG TACTTAGCCA GGGTCAACAA GAACTCGCAG CGTGAAGTCG CCGTGTTTGA 10860 
AGGGGAGTAC GTTAAGTGCC CTAAGAACAG AAAGATCTGT GGAGCAAATA TAAGATTTTT 10 920 
TGATATAGGA GTGACGGGTG ATTCGTACGC ACCCGTTACC TTCTATATGG ATTTCTCCAT 1098 0 
TTCAAGCGTA GGAGCCGTTT CATTCGTGGT GAGAGGTCCT GAGGGTAAGC AAGTGTCACT 1104 0 
CACTGGAACT CCAGCGTATA ACTTTTCGTC TGTGGCTCTC GGATCACGCA GTGTCCGAGA 11100 
ATTGCATATT AGTTTAAATA ATAAAGTTTT TCTCGGTTTG CTTCTACATA GAAAGGCGGA 1116 0 
TCGACGAATA CTTTTCACTA AGGATGAAGC GATTCGATAC GCCGATTCAA TTGATATCGC 1122 0 
GGATGTGCTA AAGGAATATA AAAGTTACGC GGCCAGTGCC TTACCACCAG ACGAGGATGT 112 8 0 
CGAATTACTC CTGGGAAAGT CTGTTCAAAA AGTTTTACGG GGAAGCAGAC TGGAAGAAAT 1134 0 
ACCTCTCTAG GAGCATAGCA GCACACTCAA GTGAAATTAA AACTCTACCA GACATTCGAT 114 00 
TGTACGGCGG TAGGGTTGTA AAGAAGTCCG AATTCGAATC AGCACTTCCT AATTCTTTTG 11460 
AACAGGAATT AGGACTGTTC ATACTGAGCG AACGGGAAGT GGGATGGAGC AAATTATGCG 11520 
GAATAACGGT GGAAGAAGCA GCATACGATC TTACGAATCC CAAGGCTTAT AAATTCACTG 11580 
CCGAGACATG TAGCCCGGAT GTAAAAGGTG AAGGACAAAA ATACTCTATG GAAGACGTGA 1164 0 
TGAATTTCAT GCGTTTATCA AATCTGGATG TTAACGACAA GATGCTGACG GAACAGTGTT 117 00 
GGTCGCTGTC CAATTCATGC GGTGAATTGA TCAACCCAGA CGACAAAGGG CGATTCGTGG 1176 0 
CTCTCACCTT TAAGGACAGA GACACAGCTG ATGACACGGG TGCCGCCAAC GTGGAATGTC 11820 
GCGTGGGCGA CTATCTAGTT TACGCTATGT CCGTGTTTGA GCAGAGGACC CAAAAATCGC 11880 
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AGTCTGGCAA CATCTCTCTG TACGAAAAGT ACTGTGAATA CATCAGGACC TACTTAGGGA 1194 0 

GTACAGACCT GTTCTTCACA GCGCCGGACA GGATTCCGTT ACTTACGGGC ATCCTATACG 12 0 00 

ATTTTTGTAA GGAATACAAC GTTTTCTACT CGTCATATAA GAGAT^CGTC GATAATTTCA 12 06 0 

GATTCTTCTT GGCGAATTAT ATGCCTTTGA TATCTGACGT CTTTGTCTTC CAGTGGGTAA 1212 0 

AACCCGCGCC GGATGTTCGG CTGCTTTTTG AGTTAAGTGC AGCGGAACTA ACGCTGGAGG 1218 0 

TTCCCACACT GAGTTTGATA GATTCTCAAG TTGTGGTAGG TCATATCTTA AGATACGTAG 12240 

AATCCTACAC ATCAGATCCA GCCATCGACG CGTTAGAAGA CAAACTGGAA GCGATACTGA 12 3 00 

AAAGTAGCAA TCCCCGTCTA TCGACAGCGC AACTATGGGT TGGTTTCTTT TGTTACTATG 12 36 0 

GTGAGTTTCG TACGGCTCAA AGTAGAGTAG TGCAAAGACC AGGCGTATAC AAAACACCTG 1242 0 

ACTCAGTGGG TGGATTTGAA ATAAACATGA AAGATGTTGA GAAATTCTTC GATAAACTTC 124 80 

AGAGAGAATT GCCTAATGTA TCTTTGCGGC GTCAGTTTAA CGGAGCTAGA GCGCATGAGG 12 54 0 

CTTTCAAAAT ATTTAAAAAC GG7VAATATAA GTTTCAGACC TATATCGCGT TTAAACGTGC 12 6 00 

CTAGAGAGTT CTGGTATCTG AACATAGACT ACTTCAGGCA CGCGAATAGG TCCGGGTTAA 12 66 0 

CCGAAGAAGA AATACTCATC CTAAACAACA TAAGCGTTGA TGTTAGGAAG TTATGCGCTG 12 72 0 

AGAGAGCGTG CAATACCCTA CCTAGCGCGA AGCGCTTTAG TAAAAATCAT AAGAGTAATA 12780 

TACAATCATC ACGCCAAGAG CGGAGGATTA AAGACCCATT GGTAGTCCTG AAAGACACTT 12 840 

TATATGAGTT CCAACACAAG CGTGCCGGTT GGGGGTCTCG AAGCACTCGA GACCTCGGGA 12 900 

GTCGTGCTGA CCACGCGAAA GGAAGCGGTT GATAAGTTTT TTAATGAACT AAAAAACGAA 12 96 0 

AATTACTCAT CAGTTGACAG CAGCCGATTA AGCGATTCGG AAGTAAAAGA AGTGTTAGAG 13 020 

AAAAGTATIAG AAAGTTTCAA AAGCGAACTG GCCTCCACTG ACGAGCACTT CGTCTACCAC 13 080 

ATTATATTTT TCTTAATCCG ATGTGCTT^G ATATCGACAA GTGAAAAGGT GAAGTACGTT 1314 0 

GGTAGTCATA CGTACGTGGT CGACGGAAAA ACGTACACCG TTCTTGACGC TTGGGTATTC 13 2 00 

AACATGATGA AAAGTCTCAC GAAGAAGTAC AAACGAGTGA ATGGTCTGCG TGCGTTCTGT 13 26 0 

TGCGCGTGCG AAGATCTATA TCTAACCGTC GCACCAATAA TGTCAGAACG CTTTAAGACT 13 32 0 

AAAGCCGTAG GGATGAAAGG TTTGCCTGTT GGAAAGGAAT ACTTAGGCGC CGACTTTCTT 13 3 80 

TCGGGAACTA GCAAACTGAT GAGCGATCAC GACAGGGCGG TCTCCATCGT TGCAGCGT^AA 1344 0 

AACGCTGTCG ATCGTAGCGC TTTCACGGGT GGGGAGAGAA AGATAGTTAG TTTGTATGAT 13 5 00 

CTAGGGAGGT ACTAAGCACG GTGTGCTATA GTGCGTGCTA TAATAATAAA CACTAGTGCT 13 56 0 
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TAAGTCGCGC AGAAGAAAAC GCTATGGAGT TGATGTCCGA CAGCAACCTT AGCAACCTGG 13 62 0 

TGATAACCGA CGCCTCTAGT CTAAATGGTG TCGACAAGAA GCTTTTATCT GCTGAAGTTG 13 680 

AAAAAATGTT GGTGCAGAAA GGGGCTCCTA ACGAGGGTAT AG7\AGTGGTG TTCGGTCTAC 13 74 0 

TCCTTTACGC ACTCGCGGCA AGAACCACGT CTCCTAAGGT TCAGCGCGCA GATTCAGACG 13 8 00 

TTATATTTTC AAATAGTTTC GGAGAGAGGA ATGTGGTAGT AACAGAGGGT GACCTTTlAGA 13 860 

AGGTACTCGA CGGGTGTGCG CCTCTCACTA GGTTCACTAA TAAACTTAGA ACGTTCGGTC 13 920 

GTACTTTCAC TGAGGCTTAC GTTGACTTTT GTATCGCGTA TAAGCACAAA TTACCCCAAC 13 98 0 

TCAACGCCGC GGCGGAATTG GGGATTCCAG CTGAAGATTC GTACTTAGCT GCAGATTTTC 14 040 

TGGGTACTTG CCCGAAGCTC TCTGAATTAC AGCAAAGTAG GAAGATGTTC GCGAGTATGT 14100 

ACGCTCTAAA AACTGAAGGT GGAGTGGTAA ATACACCAGT GAGCAATCTG CGTCAGCTAG 1416 0 

GTAGAAGGGA AGTTATGT7UV TGGAAGATTA CGAAGAAAAA TCCGAATCGC TCATACTGCT 14 22 0 

ACGCACGAAT CTGAACACTA TGCTTTTAGT GGTCAAGTCC GATGCTAGTG TAGAGCTGCC 14 28 0 

TAAACTACTA ATTTGCGGTT ACTTACGAGT GTCAGGACGT GGGGAGGTGA CGTGTTGCAA 14 340 

CCGTGAGGAA TTAACAAGAG ATTTTGAGGG CAATCATCAT ACGGTGATCC GTTCTAGAAT 144 0 0 

CATACAATAT GACAGCGAGT CTGCTTTTGA GGAATTCAAC AACTCTGATT GCGTAGTGAA 1446 0 

Qr^^^^^CCrA GAGACTGGTA GTGTCTTTTG GTTTTTCCTT CGAAGTGAAA CCAAAGGTAG 14 52 0 

AGCGGTGCGA CATTTGCGCA CCTTCTTCGA AGCTAACAAT TTCTTCTTTG GATCGCATTG 14 580 

CGGTACCATG GAGTATTGTT TGAAGCAGGT ACTAACTGAA ACTGAATCTA TAATCGATTC 14 64 0 

TTTTTGCGAA GAAAGAAATC GTTAAGATGA GGGTTATAGT GTCTCCTTAT GAAGCTGAAG 14 7 00 

ACATTCTGAA AAGATCGACT GACATGTTAC GAAACATAGA CAGTGGGGTC TTGAGCACTA 14 760 

AAGAATGTAT CAAGGCATTC TCGACGATAA CGCGAGACCT ACATTGTGCG AAGGCTTCCT 14 820 

ACCAGTGGGG TGTTGACACT GGGTTATATC AGCGTT^TTG CGCTGAAAAA CGTTTAATTG 14 8 80 

ACACGGTGGA GTCAAACATA CGGTTGGCTC AACCTCTCGT GCGTGAAAAA GTGGCGGTTC 14 94 0 

^^^^^^^^^^ GGATGAACCA AAAGAGCTAG TAGCATTCAT CACGCGAAAG TACGTGGAAC 15 0 00 

TCACGGGCGT GGGAGTGAGA GAAGCGGTGA AGAGGGAAAT GCGCTCTCTT ACCAAAACAG 1506 0 

TTTTAAATAA AATGTCTTTG GAAATGGCGT TTTACATGTC ACCACGAGCG TGGAAAAACG 15120 

CTGAATGGTT AGAACTAAAA TTTTCACCTG TGAAAATCTT TAGAGATCTG CTATTAGACG 15180 

TGGAAACGCT CAACGAATTG TGCGCCGAAG ATGATGTTCA CGTCGACAAA GTAAATGAGA 1524 0 
ATGGGGACGA AAATCACGAC CTCGAACTCC AAGACGAATG TTAAACATTG GTTAAGTTTA 153 00 
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ACGAAAATGA TTAGTAAATA ATAAATCGAA CGTGGGTGTA TCTACCTGAC GTATCAACTT 15360 
AAGCTGTTAC TGAGTAATTA AACCAACAAG TGTTGGTGTA ATGTGTATGT TGATGTAGAG 15420 
AAAAATCCGT TTGTAGAACG GTGTTTTTCT CTTCTTTATT TTTAAAAAAA AAATAAAAAA 15480 
AAAAAAAAAA AAGCGGCGGC 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7920 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
ACATTGCGAG AGAACCCCAT TAGCGTCTCC GGGGTGAACT TGGGAAGGTC TGCCGCCGCT 
CAGGTTATTT ATTTCGGCAG TTTCACGCAG CCCTTCGCGT TGTATCCGCG CCTIAGAGAGC 
GCGATCGTAA AAACGCAACT TCCACCGGTC AGTGTAGTGA AGGTGGAGTG CGTAGCTGCG 
GAGGTAGCTC CCGACAGGGG CGTGGTCGAC AAGAAACCTA CGTCTGTTGG CGTTCCCCCG 
CAGCGCGGTG TGCTTTCTTT TCCGACGGTG GTTCGGAACC GCGGCGACGT GATAATCACA 
GGGGTGGTGC ATGAAGCCCT GAAGAAAATT AAAGACGGGC TCTTACGCTT CCGCGTAGGC 
GGTGACATGC GTTTTTCGAG ATTTTTCTCA TCGAACTACG GCTGCAGATT CGTCGCGAGC 
GTGCGTACGA ACACTACAGT TTGGCTAAAT TGCACGAAAG CGAGTGGTGA GAAATTCTCA 
CTCGCCGCCG CGTGCACGGC GGATTACGTG GCGATGCTGC GTTATGTGTG TGGCGGGAAA 
TTTCCACTCG TCCTCATGAG TAGAGTTATT TACCCGGATG GGCGCTGTTA CTTGGCCCAT 
ATGAGGTATT TGTGCGCCTT TTACTGTCGC CCGTTTAGAG AGTCGGATTA TGCCCTCGGA 
ATGTGGCCTA CGGTGGCGCG TCTCAGGGCA TGCGTTGAGA AGAACTTCGG TGTCGAAGCT 
TGTGGCATAG CTCTTCGTGG CTATTACACC TCTCGCAATG TTTATCACTG TGATTATGAC 
TCTGCTTATG TAAAATATTT TAGAAACCTT TCCGGCCGCA TTGGCGGTGG TTCGTTCGAT 
CCGACATCTT TAACCTCCGT AATAACGGTG AAGATTAGCG GTCTTCCAGG TGGTCTTCCT 
AAAAATATAG CGTTTGGTGC CTTCCTGTGC GATATACGTT ACGTCGAACC GGTAGACTCG 
GGCGGCATTC AATCGAGCGT TAAGACGAAA CGTGAAGATG CGCACCGAAC CGTAGAGGAA 
CGGGCGGCCG GCGGATCCGT CGAGCAACCG CGACAAAAGA GGATAGATGA GAAAGGTTGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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GGCAGAGTTC CTAGTGGAGG TTTTTCGCAT CTCCTGGTCG GCAACCTTAA CGAAGTTAGG 114 0 

AGGAAGGTAG CTGCCGGACT TCTACGCTTT CGCGTTGGCG GTGATATGGA TTTTCATCGC 12 00 

TCGTTCTCCA CCCAAGCGGG CCACCGCTTG CTGGTGTGGC GCCGCTCGAG CCGGAGCGTG 126 0 

TGCCTTGAAC TTTACTCACC ATCTAAAAAC TTTTTGCGTT ACGATGTCTT GCCCTGTTCT 1320 

GGAGACTATG CAGCGATGTT TTCTTTCGCG GCGGGCGGCC GTTTCCCTTT AGTTTTGATG 13 8 0 

ACTAGAATTA GATACCCGAA CGGGTTTTGT TACTTGGCTC ACTGCCGGTA CGCGTGCGCG 144 0 

TTTCTCTTAA GGGGTTTTGA TCCGAAGCGT TTCGACATCG GTGCTTTCCC CACCGCGGCC 15 00 

AAGCTCAGAA ACCGTATGGT TTCGGAGCTT GGTGT^GAA GTTTAGGTTT GAACTTGTAC 156 0 

GGCGCATATA CGTCACGCGG CGTCTTTCAC TGCGATTATG ACGCTAAGTT TATAAAGGAT 16 2 0 

TTGCGTCTTA TGTCAGCAGT TATAGCTGGA AAGGACGGGG TGGAAGAGGT GGTACCTTCT 16 8 0 

GACATAACTC CTGCCATGAA GCAGAAAACG ATCGT^GCCG TGTATGATAG ATTATATGGC 174 0 

GGCACTGACT CGTTGCTGAA ACTGAGCATC GAGAAAGACT TAATCGATTT CAAAAATGAC 18 00 

GTGCAGAGTT TGAAGAAAGA TCGGCCGATT GTCAAAGTGC CCTTTTACAT GTCGGAAGCA 186 0 

ACACAGAATT CGCTGACGCG TTTCTACCCT CAGTTCGAAC TTAAGTTTTC GCACTCCTCG 192 0 

CATTCAGATC ATCCCGCCGC CGCCGCTTCT AGACTGCTGG AAAATGAAAC GTTAGTGCGC 198 0 

TTATGTGGTA ATAGCGTTTC AGATATTGGA GGTTGTCCTC TTTTCCATTT GCATTCCAAG 2 04 0 

ACGCAAAGAC GGGTTCACGT ATGTAGGCCT GTGTTGGATG GCAAGGATGC GCAGCGTCGC 210 0 

GTGGTGCGTG ATTTGCAGTA TTCCAACGTG CGTTTGGGAG ACGATGATAA AATTTTGGAA 216 0 

GGGCCACGCA ATATCGACAT TTGCCACTAT CCTCTGGGCG CGTGTGACCA CGAAAGTAGT 2 220 

GCTATGATGA TGGTGCAGGT GTATGACGCG TCCCTTTATG AGATATGTGG CGCCATGATC 2280 

AAGAAGAAAA GCCGCATAAC GTACTTAACC ATGGTCACGC CCGGCGAGTT TCTTGACGGA 2 34 0 

CGCGAATGCG TCTACATGGA GTCGTTAGAC TGTGAGATTG AAGTTGATGT GCACGCGGAC 2 4 00 

GTCGTAATGT ACAAATTCGG TAGTTCTTGC TATTCGCACA AGCTTTCAAT CATCAAGGAC 24 6 0 

ATCATGACCA CTCCGTACTT GACACTAGGT GGTTTTCTAT TCAGCGTGGA GATGTATGAG 2 52 0 

GTGCGTATGG GCGTGAATTA CTTCAAGATT ACGAAGTCCG AAGTATCGCC TAGCATTAGC 2 58 0 

TGCACCAAGC TCCTGAGATA CCGAAGAGCT AATAGTGACG TGGTTAAAGT TAAACTTCCA 26 4 0 

CGTTTCGATA AGAAACGTCG CATGTGTCTG CCTGGGTATG ACACCATATA CCTAGATTCG 2 700 

AAGTTTGTGA GTCGCGTTTT CGATTATGTC GTGTGTAATT GCTCTGCCGT GAACTCAAAA 2 76 0 
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ACTTTCGAGT 


GGGTGTGGAG 


TTTCATTAAG 


TCTAGTAAGT 


CGAGGGTGAT 


TATTAGCGGT 


2820 


AAAATAATTC 


ACAAGGATGT 


GAATTTGGAC 


CTCAAGTACG 


TCGAGAGTTT 


CGCCGCGGTT 


2880 


ATGTTGGCCT 


CTGGCGTGCG 


CAGTAGACTA 


GCGTCCGAGT 


ACCTTGCTAA 


GAACCTTAGT 


2940 


CATTTTTCGG 


GAGATTGCTC 


CTTTATTGAA 


GCCACGTCTT 


TCGTGTTGCG 


TGAGAAAATC 


3000 


AGAAACATGA 


CTCTGAATTT 


TAACGAAAGA 


CTTTTACAGT 


TAGTGAAGCG 


CGTTGCCTTT 


3060 


GCGACCTTGG 


ACGTGAGTTT 


TCTAGATTTA 


GATTCAACTC 


TTGAATCAAT 


AACTGATTTT 


3120 


GCCGAGTGTA 


AGGTAGCGAT 


TGAACTCGAC 


GAGTTGGGTT 


GCTTGAGAGC 


GGAGGCCGAG 


3180 


AATGAAAAAA 


TCAGGAATCT 


GGCGGGAGAT 


TCGATTGCGG 


CTAAACTCGC 


GAG CG AG ATA 


3240 


GTGGTCGATA 


TTGACTCTAA 


GCCTTCACCG 


AAGCAGGTGG 


GTAATTCGTC 


ATCCGAAAAC 


3300 


GCCGATAAGC 


GGGAAGTTCA 


GAGGCCCGGT 


TTGCGTGGTG 


GTTCTAGAAA 


CGGGGTTGTT 


3360 


GGGGAGTTCC 


TTCACTTCGT 


CGTGGATTCT 


GCCTTGCGTC 


TTTTCAAATA 


CGCGACGGAT 


3420 


CAACAACGGA 


TCAAGTCTTA 


CGTGCGTTTC 


TTGGACTCGG 


CGGTCTCATT 


CTTGGATTAC 


3480 


AACTACGATA 


ATCTATCGTT 


TATACTGCGA 


GTGCTTTCGG 


AAGGTTATTC 


GTGTATGTTC 


3540 


GCGTTTTTGG 


CGAATCGCGG 


CGACTTATCT 


AGTCGTGTCC 


GTAGCGCGGT 


GTGTGCTGTG 


3600 


AAAGAAGTTG 


CTACCTCATG 


CGCGAACGCG 


AGCGTTTCTA 


AAGCCAAGGT 


TATGATTACC 


3660 


TTCGCAGCGG 


CCGTGTGTGC 


TATGATGTTT 


AATAGCTGCG 


GTTTTTCAGG 


CGACGGTCGG 


3720 


GAGTATAAAT 


CGTATATACA 


TCGTTACACG 


CAAGTATTGT 


TTGACACTAT 


CTTTTTTGAG 


3780 


GACAGCAGTT 


ACCTACCCAT 


AGAAGTTCTG 


AGTTCGGCGA 


TATGCGGTGC 


TATCGTCACA 


3840 


CTTTTCTCCT 


CGGGCTCGTC 


CATAAGTTTA 


AACGCCTTCT 


TACTTCAAAT 


TACCAAAGGA 


3900 


TTCTCCCTAG 


AGGTTGTCGT 


CCGGAATGTT 


GTGCGAGTCA 


CGCATGGTTT 


GAGCACCACA 


3960 


GCGACCGACG 


GCGTCATACG 


TGGGGTTTTC 


TCCCAAATTG 


TGTCTCACTT 


ACTTGTTGGA 


4020 


AATACGGGTA 


ATGTGGCTTA 


CCAGTCAGCT 


TTCATTGCCG 


GGGTGGTGCC 


TCTTTTAGTT 


4080 


AAAAAGTGTG 


TGAGCTTAAT 


CTTCATCTTG 


CGTGAAGATA 


CTTATTCCGG 


TTTTATTAAG 


4140 


CACGGAATCA 


GTGAATTCTC 


TTTCCTTAGT 


AGTATTCTGA 


AGTTCTTGAA 


GGGTAAGCTT 


4200 


GTGGACGAGT 


TGAAATCGAT 


TATTCAAGGG 


GTTTTTGATT 


CCAACAAGCA 


CGTGTTTAAA 


4260 


GAAGCTACTC 


AGGAAGCGAT 


TCGTACGACG 


GTCATGCAAG 


TGCCTGTCGC 


TGTAGTGGAT 


4320 


GCCCTTAAGA 


GCGCCGCGGG 


AAAAATTTAT 


AACAATTTTA 


CTAGTCGACG 


TACCTTTGGT 


4380 


AAGGATGAAG 


GCTCCTCTAG 


CGACGGCGCA 


TGTGAAGAGT 


ATTTCTCATG 


CGACGAAGGT 


4440 


GAAGGTCCGG 


GTCTGAAAGG 


GGGTTCCAGC 


TATGGCTTCT 


CAATTTTAGC 


GTTCTTTTCA 


4500 
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CGCATTATGT GGGGAGCTCG TCGGCTTATT GTTAAGGTGA AGCATGAGTG TTTTGGGAAA 4 56 0 

^^^rp^rpQ^^rp TTCTATCGCT CAAGCTTCAC GAATTCAGGA CTCGCGTTTT TGGGAAGAAT 4 620 

AGAACGGACG TGGGAGTTTA CGATTTTTTG CCCACGGGCA TCGTGGAAAC GCTCTCATCG 4 680 

ATAGAAGAGT GCGACCA7VAT TGAAGAACTT CTCGGCGACG ACCTGAAAGG TGACAAGGAT 4 74 0 

GCTTCGTTGA CCGATATGAA TTACTTTGAG TTCTCAGAAG ACTTCTTAGC CTCTATCGAG 4 800 

GAGCCGCCTT TCGCTGGATT GCGAGGAGGT AGCAAGAACA TCGCGATTTT GGCGATTTTG 4 86 0 

GAATACGCGC ATAATTTGTT TCGCATTGTC GCAAGCAAGT GTTCGAAACG ACCTTTATTT 4 920 

CTTGCTTTCG CCGAACTCTC AAGCGCCCTT ATCGAGAAAT TTAAGGAGGT TTTCCCTCGT 4 980 

AAGAGCCAGC TCGTCGCTAT CGTGCGCGAG TATACTCAGA GATTCCTCCG AAGTCGCATG 5 04 0 

CGTGCGTTGG GTTTGAATAA CGAGTTCGTG GTAAAATCTT TCGCCGATTT GCTACCCGCA 510 0 

TTAATGAAGC GGAAGGTTTC AGGTTCGTTC TTAGCTAGTG TTTATCGCCC ACTTAGAGGT 516 0 

TTCTCATATA TGTGTGTTTC AGCGGAGCGA CGTGAAAAGT TTTTTGCTCT CGTGTGTTTA 522 0 

ATCGGGTTAA GTCTCCCTTT CTTCGTGCGC ATCGTAGGAG CGAAAGCGTG CGAAGAACTC 52 80 

GTGTCCTCAG CGCGTCGCTT TTATGAGCGT ATTAAAATTT TTCTAAGGCA GAAGTATGTC 534 0 

TCTCTTTCTA ATTTCTTTTG TCACTTGTTT AGCTCTGACG TTGATGACAG TTCCGCATCT 5 4 00 

GCAGGGTTGA AAGGTGGTGC GTCGCGAATG ACGCTCTTCC ACCTTCTGGT TCGCCTTGCT 54 6 0 

AGTGCCCTCC TATCGTTAGG GTGGGAAGGG TTAAAGCTAC TCTTATCGCA CCACAACTTG 5 520 

TTATTTTTGT GTTTTGCATT GGTTGACGAT GTGAACGTCC TTATCAAAGT TCTTGGGGGT 5580 

CTTTCTTTCT TTGTGCAACC AATCTTTTCC TTGTTTGCGG CGATGCTTCT ACAACCGGAC 564 0 

AGGTTTGTGG AGTATTCCGA GAAACTTGTT ACAGCGTTTG AATTTTTCTT AAAATGTTCG 57 00 

CCTCGCGCGC CTGCACTACT CAAAGGGTTT TTTGAGTGCG TGGCGAACAG CACTGTGTCA 5 76 0 

AAAACCGTTC GAAGACTTCT TCGCTGTTTC GTGAAGATGC TCAAACTTCG AAAAGGGCGA 5 82 0 

GGGTTGCGTG CGGATGGTAG GGGTCTCCAT CGGCAGAAAG CCGTACCCGT CATACCTTCT 5 880 

AATCGGGTCG TGACCGACGG GGTTGAAAGA CTTTCGGTAA AGATGCAAGG AGTTGAAGCG 5 94 0 

TTGCGTACCG AATTGAGAAT CTTAGAAGAT TTAGATTCTG CCGTGATCGA AAAACTCAAT 6 000 

AGACGCAGAA ATCGTGACAC TAATGACGAC GAATTTACGC GCCCTGCTCA TGAGCAGATG 6 06 0 

CAAGAAGTCA CCACTTTCTG TTCGAAAGCC AACTCTGCTG GTTTGGCCCT GGAAAGGGCA 6120 

GTGCTTGTGG AAGACGCTAT AAAGTCGGAG AAACTTTCTA AGACGGTTAA TGAGATGGTG 618 0 
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AGGAAAGGGA GTACCACCAG CGAAGAAGTG GCCGTCGCTT TGTCGGACGA TGAAGCCGTG 6 24 0 

GAAGAAATCT CTGTTGCTGA CGAGCGAGAC GATTCGCCTA AGACAGTCAG GATAAGCGAA 6 3 00 

TACCTAAATA GGTTAAACTC AAGCTTCGAA TTCCCGAAGC CTATTGTTGT GGACGACAAC 6 3 60 

AAGGATACCG GGGGTCTAAC GAACGCCGTG AGGGAGTTTT ATTATATGCA AGAACTTGCT 
CTTTTCGAAA TCCACAGCAA ACTGTGCACC TACTACGATC AACTGCGCAT AGTCAACTTC 
GATCGTTCCG TAGCACCATG CAGCGT^GAT GCTCAGCTGT ACGTACGGAA GAACGGCTCA 
ACGATAGTGC AGGGTAAAGA GGTACGTTTG CACATTAAGG ATTTCCACGA TCACGATTTC 
CTGTTTGACG GAAAAATTTC TATTAACAAG CGGCGGCGAG GCGGAAATGT TTTATATCAC 
GACAACCTCG CGTTCTTGGC GAGTAATTTG TTCTTAGCCG GCTACCCCTT TTCAAGGAGC 
TTCGTCTTCA CGAATTCGTC GGTCGATATT CTCCTCTACG AAGCTCCACC CGGAGGTGGT 
AAGACGACGA CGCTGATTGA CTCGTTCTTG AAGGTCTTCA AGAAAGGTGA GGTTTCCACC 
ATGATCTTAA CCGCCAACAA AAGTTCGCAG GTTGAGATCC TAAAGAAAGT GGAGAAGGAA 
GTGTCTAACA TTGAATGCCA GAAACGTAAA GACAAAAGAT CTCCGAAAAA GAGCATTTAC 
ACCATCGACG CTTATTTAAT GCATCACCGT GGTTGTGATG CAGACGTTCT TTTCATCGAT 
GAGTGTTTCA TGGTTCATGC GGGTAGCGTA CTAGCTTGCA TTGAGTTCAC GAGGTGTCAT 
AAAGTAATGA TCTTCGGGGA TAGCCGGCAG ATTCACTACA TTGAAAGGAA CGAATTGGAC 
AAGTGTTTGT ATGGGGATCT CGACAGGTTC GTGGACCTGC AGTGTCGGGT TTATGGTAAT 
ATTTCGTACC GTTGTCCATG GGATGTGTGC GCTTGGTTAA GCACAGTGTA TGGCAACCTA 
ATCGCCACCG TGAAGGGTGA AAGCGAAGGT AAGAGCAGCA TGCGCATTAA CGAAATTAAT 
TCAGTCGACG ATTTAGTCCC CGACGTGGGT TCCACGTTTC TGTGTATGCT TCAGTCGGAG 
AAGTTGGAAA TCAGCAAGCA CTTTATTCGC AAGGGTTTGA CTAAACTTAA CGTTCTAACG 
GTGCATGAGG CGCAAGGTGA GACGTATGCG CGTGTGAACC TTGTGCGACT TAAGTTTCAG 
GAGGATGAAC CCTTTAAATC TATCAGGCAC ATAACCGTCG CTCTTTCTCG TCACACCGAC 
AGCTTAACTT ATAACGTCTT AGCTGCTCGT CGAGGTGACG CCACTTGCGA TGCCATCCAG 
AAGGCTGCGG AATTGGTGAA CAAGTTTCGC GTTTTTCCTA CATCTTTTGG TGGTAGTGTT 
ATCAATCTCA ACGTGAAGAA GGACGTGGAA GATAACAGTA GGTGCAAGGC TTCGTCGGCA 7 74 0 

CCATTGAGCG TAATCAACGA CTTTTTGAAC GT^GTTAATC CCGGTACTGC GGTGATTGAT 7 8 00 

TTTGGTGATT TGTCCGCGGA CTTCAGTACT GGGCCTTTTG AGTGCGGTGC CAGCGGTATT 7 86 0 

GTGGTGCGGG ACAACATCTC CTCCAGCAAC ATCACTGATC ACGATAAGCA GCGTGTTTAG 7 92 0 



6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
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(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2639 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : 

(D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Thr Leu Arg Glu Asn Pro He Ser Val Ser Gly Val Asn Leu Gly Arg 
1 5 10 15 

ser Ala Ala Ala Gin Val He Tyr Phe Gly Ser Phe Thr Gin Pro Phe 
20 25 30 

Ala Leu Tyr Pro Arg Gin Glu Ser Ala He Val Lys Thr Gin Leu Pro 
35 40 45 

Pro Val Ser Val Val Lys Val Glu Cys Val Ala Ala Glu Val Ala Pro 
50 55 60 

Asp Arg Gly Val Val Asp Lys Lys Pro Thr Ser Val Gly Val Pro Pro 
65 70 75 80 

Gin Arg Gly Val Leu Ser Phe Pro Thr Val Val Arg Asn Arg Gly Asp 
85 90 95 

Val He He Thr Gly Val Val His Glu Ala Leu Lys Lys He Lys Asp 
100 105 110 

Gly Leu Leu Arg Phe Arg Val Gly Gly Asp Met Arg Phe Ser Arg Phe 
115 120 125 

Phe Ser Ser Asn Tyr Gly Cys Arg Phe Val Ala Ser Val Arg Thr Asn 
130 135 140 

Thr Thr Val Trp Leu Asn Cys Thr Lys Ala Ser Gly Glu Lys Phe Ser 
145 150 155 160 

Leu Ala Ala Ala Cys Thr Ala Asp Tyr Val Ala Met Leu Arg Tyr Val 
165 170 175 

Cys Gly Gly Lys Phe Pro Leu Val Leu Met Ser Arg Val He Tyr Pro 
180 185 190 

Asp Gly Arg Cys Tyr Leu Ala His Met Arg Tyr Leu Cys Ala Phe Tyr 
195 200 205 

Cys Arg Pro Phe Arg Glu Ser Asp Tyr Ala Leu Gly Met Trp Pro Thr 
210 215 220 
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Val Ala Arg Leu Arg Ala Cys Val Glu Lys Asn Phe Gly Val Glu Ala 
225 230 235 240 

Cys Gly He Ala Leu Arg Gly Tyr Tyr Thr Ser Arg Asn Val Tyr His 
245 250 255 

Cys Asp Tyr Asp Ser Ala Tyr Val Lys Tyr Phe Arg Asn Leu Ser Gly 
260 265 270 

Arg He Gly .Gly Gly Ser Phe Asp Pro Thr Ser Leu Thr Ser Val He 
275 280 285 

Thr Val Lys He Ser Gly Leu Pro Gly Gly Leu Pro Lys Asn He Ala 
290 295 300 

Phe Gly Ala Phe Leu Cys Asp He Arg Tyr Val Glu Pro Val Asp Ser 
305 310 315 320 

Gly Gly He Gin Ser Ser Val Lys Thr Lys Arg Glu Asp Ala His Arg 
325 330 335 

Thr Val Glu Glu Arg Ala Ala Gly Gly Ser Val Glu Gin Pro Arg Gin 
340 345 350 

Lys Arg He Asp Glu Lys Gly Cys Gly Arg Val Pro Ser Gly Gly Phe 
355 360 365 

Ser His Leu Leu Val Gly Asn Leu Asn Glu Val Arg Arg Lys Val Ala 
370 375 380 

Ala Gly Leu Leu Arg Phe Arg Val Gly Gly Asp Met Asp Phe His Arg 
385 390 395 4OO 

Ser Phe Ser Thr Gin Ala Gly His Arg Leu Leu Val Trp Arg Arg Ser 
405 410 415 

Ser Arg Ser Val Cys Leu Glu Leu Tyr Ser Pro Ser Lys Asn Phe Leu 
420 425 430 

Arg Tyr Asp Val Leu Pro Cys Ser Gly Asp Tyr Ala Ala Met Phe Ser 
435 440 445 

Phe Ala Ala Gly Gly Arg Phe Pro Leu Val Leu Met Thr Arg He Arg 
450 455 4,60 

Tyr Pro Asn Gly Phe Cys Tyr Leu Ala His Cys Arg Tyr Ala Cys Ala 
465 470 475 48O 

Phe Leu Leu Arg Gly Phe Asp Pro Lys Arg Phe Asp He Gly Ala Phe 
485 490 495 

Pro Thr Ala Ala Lys Leu Arg Asn Arg Met Val Ser Glu Leu Gly Glu 
500 505 510 

Arg Ser Leu Gly Leu Asn Leu Tyr Gly Ala Tyr Thr Ser Arg Gly Val 
515 520 525 
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Phe His Cys Asp Tyr Asp Ala Lys Phe He Lys Asp Leu Arg Leu Met 
530 535 540 

Ser Ala Val He Ala Gly Lys Asp Gly Val Glu Glu Val Val Pro Ser 
545 550 555 560 

Asp He Thr Pro Ala Met Lys Gin Lys Thr He Glu Ala Val Tyr Asp 
565 570 575 

Arg Leu Tyr Gly Gly Thr Asp Ser Leu Leu Lys Leu Ser He Glu Lys 
580 585 590 

Asp Leu He Asp Phe Lys Asn Asp Val Gin Ser Leu Lys Lys Asp Arg 
595 600 605 

Pro He Val Lys Val Pro Phe Tyr Met Ser Glu Ala Thr Gin Asn Ser 
610 615 620 

Leu Thr Arg Phe Tyr Pro Gin Phe Glu Leu Lys Phe Ser His Ser Ser 
625 630 635 640 

His Ser Asp His Pro Ala Ala Ala Ala Ser Arg Leu Leu Glu Asn Glu 
645 650 655 

Thr Leu Val Arg Leu Cys Gly Asn Ser Val Ser Asp He Gly Gly Cys 
660 665 670 

Pro Leu Phe His Leu His Ser Lys Thr Gin Arg Arg Val His Val Cys 
675 680 685 

Arg Pro Val Leu Asp Gly Lys Asp Ala Gin Arg Arg Val Val Arg Asp 
690 695 700 

Leu Gin Tyr Ser Asn Val Arg Leu Gly Asp Asp Asp Lys He Leu Glu 
705 710 715 720 

Gly Pro Arg Asn He Asp He Cys His Tyr Pro Leu Gly Ala Cys Asp 
725 730 735 

His Glu Ser Ser Ala Met Met Met Val Gin Val Tyr Asp Ala Ser Leu 
740 745 750 

Tyr Glu He Cys Gly Ala Met He Lys Lys Lys Ser Arg He Thr Tyr 
755 760 765 

Leu Thr Met Val Thr Pro Gly Glu Phe Leu Asp Gly Arg Glu Cys Val 
770 775 780 

Tyr Met Glu Ser Leu Asp Cys Glu He Glu Val Asp Val His Ala Asp 
785 790 795 800 

Val Val Met Tyr Lys Phe Gly Ser Ser Cys Tyr Ser His Lys Leu Ser 
805 810 815 

He He Lys Asp He Met Thr Thr Pro Tyr Leu Thr Leu Gly Gly Phe 
820 825 830 
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Leu Phe Ser Val Glu Met Tyr Glu Val Arg Met Gly Val Asn Tyr Phe 
835 840 845 

Lys He Thr Lys Ser Glu Val Ser Pro Ser He Ser Cys Thr Lys Leu 
850 855 860 

Leu Arg Tyr Arg Arg Ala Asn Ser Asp Val Val Lys Val Lys Leu Pro 
865 870 875 880 

Arg Phe Asp Lys Lys Arg Arg Met Cys Leu Pro Gly Tyr Asp Thr He 
885 890 895 

Tyr Leu Asp Ser Lys Phe Val Ser Arg Val Phe Asp Tyr Val Val Cys 
900 905 910 

Asn Cys Ser Ala Val Asn Ser Lys Thr Phe Glu Trp Val Trp Ser Phe 
915 920 925 

He Lys Ser Ser Lys Ser Arg Val He He Ser Gly Lys He He His 
930 935 940 

Lys Asp Val Asn Leu Asp Leu Lys Tyr Val Glu Ser Phe Ala Ala Val 
945 950 955 960 

Met Leu Ala Ser Gly Val Arg Ser Arg Leu Ala Ser Glu Tyr Leu Ala 
965 970 975 

Lys Asn Leu Ser His Phe Ser Gly Asp Cys Ser Phe He Glu Ala Thr 
980 985 990 

Ser Phe Val Leu Arg Glu Lys He Arg Asn Met Thr Leu Asn Phe Asn 
995 1000 1005 

Glu Arg Leu Leu Gin Leu Val Lys Arg Val Ala Phe Ala Thr Leu Asp 
1010 1015 1020 

Val Ser Phe Leu Asp Leu Asp Ser Thr Leu Glu Ser He Thr Asp Phe 
1025 1030 1035 io40 

Ala Glu Cys Lys Val Ala He Glu Leu Asp Glu Leu Gly Cys Leu Arg 
1045 1050 1055 

Ala Glu Ala Glu Asn Glu Lys He Arg Asn Leu Ala Gly Asp Ser He 
1060 1065 1070 

Ala Ala Lys Leu Ala Ser Glu He Val Val Asp He Asp Ser Lys Pro 
1075 1080 1085 

Ser Pro Lys Gin Val Gly Asn Ser Ser Ser Glu Asn Ala Asp Lys Arg 
1090 1095 1100 

Glu Val Gin Arg Pro Gly Leu Arg Gly Gly Ser Arg Asn Gly Val Val 
1105 1110 1115 1120 

Gly Glu Phe Leu His Phe Val Val Asp Ser Ala Leu Arg Leu Phe Lys 
H25 1130 1135 
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Tyr Ala Thr Asp Gin Gin Arg lie Lys Ser Tyr Val Arg Phe Leu Asp 
1140 1145 1150 

Ser Ala Val Ser Phe Leu Asp Tyr Asn Tyr Asp Asn Leu Ser Phe lie 
1155 1160 1165 

Leu Arg Val Leu Ser Glu Gly Tyr Ser Cys Met Phe Ala Phe Leu Ala 
1170 1175 1180 

Asn Arg Gly Asp Leu Ser Ser Arg Val Arg Ser Ala Val Cys Ala Val 
1185 1190 1195 1200 

Lys Glu Val Ala Thr Ser Cys Ala Asn Ala Ser Val Ser Lys Ala Lys 
1205 1210 1215 

Val Met lie Thr Phe Ala Ala Ala Val Cys Ala Met Met Phe Asn Ser 
1220 1225 1230 

Cys Gly Phe Ser Gly Asp Gly Arg Glu Tyr Lys Ser Tyr lie His Arg 
1235 1240 1245 

Tyr Thr Gin Val Leu Phe Asp Thr He Phe Phe Glu Asp Ser Ser Tyr 
1250 1255 1260 

Leu Pro He Glu Val Leu Ser Ser Ala He Cys Gly Ala He Val Thr 
1265 1270 1275 1280 

Leu Phe Ser Ser Gly Ser Ser He Ser Leu Asn Ala Phe Leu Leu Gin 
1285 1290 1295 

He Thr Lys Gly Phe Ser Leu Glu Val Val Val Arg Asn Val Val Arg 
1300 1305 1310 

Val Thr His Gly Leu Ser Thr Thr Ala Thr Asp Gly Val He Arg Gly 
1315 1320 1325 

Val Phe Ser Gin He Val Ser His Leu Leu Val Gly Asn Thr Gly Asn 
1330 1335 1340 

Val Ala Tyr Gin Ser Ala Phe He Ala Gly Val Val Pro Leu Leu Val 
1345 1350 1355 1360 

Lys Lys Cys Val Ser Leu He Phe He Leu Arg Glu Asp Thr Tyr Ser 
1365 1370 1375 

Gly Phe He Lys His Gly He Ser Glu Phe Ser Phe Leu Ser Ser He 
1380 1385 1390 

Leu Lys Phe Leu Lys Gly Lys Leu Val Asp Glu Leu Lys Ser He He 
1395 1400 1405 

Gin Gly Val Phe Asp Ser Asn Lys His Val Phe Lys Glu Ala Thr Gin 
1410 1415 1420 



Glu Ala He Arg Thr Thr Val Met Gin Val Pro Val Ala Val Val Asp 
1425 1430 1435 1440 
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Ala Leu Lys Ser Ala Ala Gly Lys lie Tyr Asn Asn Phe Thr Ser Arg 
1445 1450 1455 

Arg Thr Phe Gly Lys Asp Glu Gly Ser Ser Ser Asp Gly Ala Cys Glu 
1460 1465 1470 

Glu Tyr Phe Ser Cys Asp Glu Gly Glu Gly Pro Gly Leu Lys Gly Gly 
1475 1480 1485 

Ser Ser Tyr Gly Phe Ser lie Leu Ala Phe Phe Ser Arg lie Met Trp 
1490 1495 1500 

Gly Ala Arg Arg Leu lie Val Lys Val Lys His Glu Cys Phe Gly Lys 
1505 1510 1515 1520 

Leu Phe Glu Phe Leu Ser Leu Lys Leu His Glu Phe Arg Thr Arg Val 
1525 1530 1535 

Phe Gly Lys Asn Arg Thr Asp Val Gly Val Tyr Asp Phe Leu Pro Thr 
1540 1545 1550 

Gly lie Val Glu Thr Leu Ser Ser lie Glu Glu Cys Asp Gin lie Glu 
1555 1560 1565 

Glu Leu Leu Gly Asp Asp Leu Lys Gly Asp Lys Asp Ala Ser Leu Thr 
1570 1575 1580 

Asp Met Asn Tyr Phe Glu Phe Ser Glu Asp Phe Leu Ala Ser He Glu 
1585 1590 1595 1600 

Glu Pro Pro Phe Ala Gly Leu Arg Gly Gly Ser Lys Asn He Ala He 
1605 1610 1615 

Leu Ala He Leu Glu Tyr Ala His Asn Leu Phe Arg He Val Ala Ser 
1620 1625 1630 

Lys Cys Ser Lys Arg Pro Leu Phe Leu Ala Phe Ala Glu Leu Ser Ser 
1635 1640 1645 

Ala Leu He Glu Lys Phe Lys Glu Val Phe Pro Arg Lys Ser Gin Leu 
1650 1655 1660 

Val Ala He Val Arg Glu Tyr Thr Gin Arg Phe Leu Arg Ser Arg Met 
1665 1670 1675 1680 

Arg Ala Leu Gly Leu Asn Asn Glu Phe Val Val Lys Ser Phe Ala Asp 
1685 1690 1695 

Leu Leu Pro Ala Leu Met Lys Arg Lys Val Ser Gly Ser Phe Leu Ala 
1700 1705 1710 

Ser Val Tyr Arg Pro Leu Arg Gly Phe Ser Tyr Met Cys Val Ser Ala 
1715 1720 1725 

Glu Arg Arg Glu Lys Phe Phe Ala Leu Val Cys Leu He Gly Leu Ser 
1730 1735 1740 
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Leu Pro Phe Phe Val Arg lie Val Gly Ala Lys Ala Cys Glu Glu Leu 
1745 1750 1755 1760 

Val Ser Ser Ala Arg Arg Phe Tyr Glu Arg He Lys He Phe Leu Arg 
1765 1770 1775 

Gin Lys Tyr Val. Ser Leu Ser Asn Phe Phe Cys His Leu Phe Ser Ser 
1780 1785 1790 

Asp Val Asp Asp Ser Ser Ala Ser Ala Gly Leu Lys Gly Gly Ala Ser 
1795 1800 1805 

Arg Met Thr Leu Phe His Leu Leu Val Arg Leu Ala Ser Ala Leu Leu 
1810 1815 1820 

Ser Leu Gly Trp Glu Gly Leu Lys Leu Leu Leu Ser His His Asn Leu 
1825 1830 1835 1840 

Leu Phe Leu Cys Phe Ala Leu Val Asp Asp Val Asn Val Leu He Lys 
1845 1850 1855 

Val Leu Gly Gly Leu Ser Phe Phe Val Gin Pro He Phe Ser Leu Phe 
1860 1865 1870 

Ala Ala Met Leu Leu Gin Pro Asp Arg Phe Val Glu Tyr Ser Glu Lys 
1875 1880 1885 

Leu Val Thr Ala Phe Glu Phe Phe Leu Lys Cys Ser Pro Arg Ala Pro 
1890 1895 1900 

Ala Leu Leu Lys Gly Phe Phe Glu Cys Val Ala Asn Ser Thr Val Ser 
1905 1910 1915 1920 



Lys Thr Val Arg Arg Leu Leu Arg Cys Phe Val Lys Met Leu Lys. Leu 
1925 1930 1935 

Arg Lys Gly Arg Gly Leu Arg Ala Asp Gly Arg Gly Leu His Arg Gin 
1940 1945 1950 

Lys Ala Val Pro Val He Pro Ser Asn Arg Val Val Thr Asp Gly Val 
1955 1960 1965 



Glu Arg Leu Ser Val Lys Met Gin 
1970 1975 

Leu Arg He Leu Glu Asp Leu Asp 
1985 1990 

Arg Arg Arg Asn Arg Asp Thr Asn 
2005 



Gly Val Glu Ala Leu Arg Thr Glu 
1980 

Ser Ala Val He Glu Lys Leu Asn 
1995 2000 

Asp Asp Glu Phe Thr Arg Pro Ala 
2010 2015 



His Glu Gin Met Gin Glu Val Thr Thr Phe Cys Ser Lys Ala Asn Ser 
2020 2025 2030 



Ala Gly Leu Ala Leu Glu Arg Ala Val Leu Val Glu Asp Ala He Lys 
2035 2040 2045 
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Ser Glu Lys Leu Ser Lys Thr Val Asn Glu Met Val Arg Lys Gly Ser 
2050 2055 2060 

Thr Thr Ser Glu Glu Val Ala Val Ala Leu Ser Asp Asp Glu Ala Val 
2065 2070 2075 2080 

Glu Glu lie Ser Val Ala Asp Glu Arg Asp Asp Ser Pro Lys Thr Val 
2085 2090 2095 

Arg lie Ser Glu Tyr Leu Asn Arg Leu Asn Ser Ser Phe Glu Phe Pro 
2100 2105 2110 

Lys Pro lie Val Val Asp Asp Asn Lys Asp Thr Gly Gly Leu Thr Asn 
2115 2120 2125 

Ala Val Arg Glu Phe Tyr Tyr Met Gin Glu Leu Ala Leu Phe Glu lie 
2130 2135 2140 

His Ser Lys Leu Cys Thr Tyr Tyr Asp Gin Leu Arg lie Val Asn Phe 
2145 2150 2155 2160 

Asp Arg Ser Val Ala Pro Cys Ser Glu Asp Ala Gin Leu Tyr Val Arg 
2165 2170 2175 

Lys Asn Gly Ser Thr lie Val Gin Gly Lys Glu Val Arg Leu His He 
2180 2185 2190 

Lys Asp Phe His Asp His Asp Phe Leu Phe Asp Gly Lys He Ser He 
2195 2200 2205 

Asn Lys Arg Arg Arg Gly Gly Asn Val Leu Tyr His Asp Asn Leu Ala 
2210 2215 2220 

Phe Leu Ala Ser Asn Leu Phe Leu Ala Gly Tyr Pro Phe Ser Arg Ser 
2225 2230 2235 2240 

Phe Val Phe Thr Asn Ser Ser Val Asp He Leu Leu Tyr Glu Ala Pro 
2245 2250 2255 

Pro Gly Gly Gly Lys Thr Thr Thr Leu He Asp Ser Phe Leu Lys Val 
2260 2265 2270 

Phe Lys Lys Gly Glu Val Ser Thr Met He Leu Thr Ala Asn Lys Ser 
2275 2280 2285 

Ser Gin Val Glu He Leu Lys Lys Val Glu Lys Glu Val Ser Asn He 
2290 2295 2300 

Glu Cys Gin Lys Arg Lys Asp Lys Arg Ser Pro Lys Lys Ser He Tyr 
2305 2310 2315 2320 

Thr He Asp Ala Tyr Leu Met His His Arg Gly Cys Asp Ala Asp Val 
2325 2330 2335 



Leu Phe He Asp Glu Cys Phe Met Val His Ala Gly Ser Val Leu Ala 
2340 2345 2350 
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Cys He Glu Phe Thr Arg Cys His Lys Val Met He Phe Gly Asp Ser 
2355 2360 2365 

Arg Gin He His Tyr He Glu Arg Asn Glu Leu Asp Lys Cys Leu Tyr 
2370 2375 2380 

Gly Asp Leu Asp Arg Phe Val Asp Leu Gin Cys Arg Val Tyr Gly Asn 
2385 2390 2395 2400 

He Ser Tyr Arg Cys Pro Trp Asp Val Cys Ala Trp Leu Ser Thr Val 
2405 2410 2415 

Tyr Gly Asn Leu He Ala Thr Val Lys Gly Glu Ser Glu Gly Lys Ser 
2420 2425 2430 

Ser Met Arg He Asn Glu He Asn Ser Val Asp Asp Leu Val Pro Asp 
2435 2440 2445 

Val Gly Ser Thr Phe Leu Cys Met Leu Gin Ser Glu Lys Leu Glu He 
2450 2455 2460 

Ser Lys His Phe He Arg Lys Gly Leu Thr Lys Leu Asn Val Leu Thr 
2465 2470 2475 2480 

Val His Glu Ala Gin Gly Glu Thr Tyr Ala Arg Val Asn Leu Val Arg 
2485 2490 2495 

Leu Lys Phe Gin Glu Asp Glu Pro Phe Lys Ser He Arg His He Thr 
2500 2505 2510 

Val Ala Leu Ser Arg His Thr Asp Ser Leu Thr Tyr Asn Val Leu Ala 
2515 2520 2525 

Ala Arg Arg Gly Asp Ala Thr Cys Asp Ala He Gin Lys Ala Ala Glu 
2530 2535 2540 

Leu Val Asn Lys Phe Arg Val Phe Pro Thr Ser Phe Gly Gly Ser Val 
2545 2550 2555 2560 

He Asn Leu Asn Val Lys Lys Asp Val Glu Asp Asn Ser Arg Cys Lys 
2565 2570 2575 

Ala Ser Ser Ala Pro Leu Ser Val He Asn Asp Phe Leu Asn Glu Val 
2580 2585 2590 

Asn Pro Gly Thr Ala Val He Asp Phe Gly Asp Leu Ser Ala Asp Phe 
2595 2600 2605 

Ser Thr Gly Pro Phe Glu Cys Gly Ala Ser Gly He Val Val Arg Asp 
2610 2615 2620 

Asn He Ser Ser Ser Asn He Thr Asp His Asp Lys Gin Arg Val 
2625 2630 2635 



(2) INFORMATION FOR SEQ ID NO : 4 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1380 base pairs 

(B) TYPE: nucleic acid 

(C) STRAlvrDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

AGCGTAGTTC GGTCGCAGGC GATTCCGCGT AGTUIAACCTT CTCTACAAGA AAATTTGTAT 6 0 

TCGTTTGAAG CGCGGAATTA TAACTTCTCG ACTTGCGACC GTAACACATC TGCTTCAATG 12 0 

TTCGGAGAGG CTATGGCGAT GAACTGTCTT CGTCGTTGCT TCGACCTAGA TGCCTTTTCG 18 0 

TCCCTGCGTG ATGATGTGAT TAGTATCACA CGTTCAGGCA TCGAACAATG GCTGGAGAAA 24 0 

CGTACTCCTA GTCAGATTAA AGCATTAATG AAGGATGTTG AATCGCCTTT GGAAATTGAC 30 0 

GATGAAATTT GTCGTTTTAA GTTGATGGTG AAGCGTGACG CTAAGGTGAA GTTAGACTCT 36 0 

TCTTGTTTAA CTAAACACAG CGCCGCTC;^ AATATCATGT TTCATCGCAA GAGCATTAAT 42 0 

GCTATCTTCT CTCCTATCTT TAATGAGGTG AAAAACCGAA TAATGTGCTG TCTTAAGCCT 4 80 

AACATAAAGT TTTTTACGGA GATGACTAAC AGGGATTTTG CTTCTGTTGT CAGCAACATG 54 0 

CTTGGTGACG ACGATGTGTA CCATATAGGT GAAGTTGATT TCTCAAAGTA CGACAAGTCT 6 00 

CAAGATGCTT TCGTGAAGGC TTTTGAAGAA GTAATGTATA AGGAACTCGG TGTTGATGAA 66 0 

GAGTTGCTGG CTATCTGGAT GTGCGGCGAG CGGTTATCGA TAGCTAACAC TCTCGATGGT 72 0 

CAGTTGTCCT TCACGATCGA GAATCAAAGG AAGTCGGGAG CTTCGAACAC TTGGATTGGT 7 80 

AACTCTCTCG TCACTTTGGG TATTTTAAGT CTTTACTACG ACGTTAGAAA TTTCGAGGCG 84 0 

TTGTACATCT CGGGCGATGA TTCTTTAATT TTTTCTCGCA GCGAGATTTC GAATTATGCC 90 0 

GACGACATAT GCACTGACAT GGGTTTTGAG ACAAAATTTA TGTCCCCAAG TGTCCCGTAC 96 0 

TTTTGTTCTA AATTTGTTGT TATGTGTGGT CATAAGACGT TTTTTGTTCC CGACCCGTAC 102 0 

AAGCTTTTTG TCAAGTTGGG AGCAGTCAAA GAGGATGTTT CAATGGATTT CCTTTTCGAG 108 0 

ACTTTTACCT CCTTTAAAGA CTTAACCTCC GATTTTAACG ACGAGCGCTT AATTCTlAAAG 114 0 

CTCGCTGAAC TTGTGGCTTT AAAATATGAG GTTCAAACCG GCAACACCAC CTTGGCGTTA 12 0 0 

AGTGTGATAC ATTGTTTGCG TTCGAATTTC CTCTCGTTTA GCAAGTTATA TCCTCGCGTG 126 0 

AAGGGATGGC AGGTTTTTTA CACGTCGGTT AAGAT^GCGC TTCTCAAGAG TGGGTGTTCT 13 2 0 

CTCTTCGACA GTTTCATGAC CCCTTTTGGT CAGGCTGTCA TGGTTTGGGA TGATGAGTAG 13 8 0 



wo 98/53055 



- 105- 



PCT/US98/10313 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 459 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Ser Val Val Arg Ser Gin Ala He Pro Arg Arg Lys Pro Ser Leu Gin 
15 10 15 

Glu Asn Leu Tyr Ser Phe Glu Ala Arg Asn Tyr Asn Phe Ser Thr Cys 
20 25 30 

Asp Arg Asn Thr Ser Ala Ser Met Phe Gly Glu Ala Met Ala Met Asn 
35 40 45 

Cys Leu Arg Arg Cys Phe Asp Leu Asp Ala Phe Ser Ser Leu Arg Asp 
50 55 60 

Asp Val He Ser He Thr Arg Ser Gly He Glu Gin Trp Leu Glu Lys 



65 



70 75 80 



Arq Thr Pro Ser Gin He Lys Ala Leu Met Lys Asp Val Glu Ser Pro 
85 90 95 

Leu Glu He Asp Asp Glu He Cys Arg Phe Lys Leu Met Val Lys Arg 
100 105 110 

Asp Ala Lys Val Lys Leu Asp Ser Ser Cys Leu Thr Lys His Ser Ala 
115 120 125 

Ala Gin Asn He Met Phe His Arg Lys Ser He Asn Ala He Phe Ser 
130 135 140 

Pro He Phe Asn Glu Val Lys Asn Arg He Met Cys Cys Leu Lys Pro 
145 150 155 160 

Asn He Lys Phe Phe Thr Glu Met Thr Asn Arg Asp Phe Ala Ser Val 
165 170 175 

Val Ser Asn Met Leu Gly Asp Asp Asp Val Tyr His He Gly Glu Val 
180 185 190 

Asp Phe Ser Lys Tyr Asp Lys Ser Gin Asp Ala Phe Val Lys Ala Phe 
195 200 205 

Glu Glu Val Met Tyr Lys Glu Leu Gly Val Asp Glu Glu Leu Leu Ala 
210 215 220 

He Trp Met Cys Gly Glu Arg Leu Ser He Ala Asn Thr Leu Asp Gly 
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225 230 235 



240 



Gin Leu Ser Phe Thr He Glu Asn Gin Arg Lys Ser Gly Ala Ser Asn 
245 250 255 

Thr Trp He Gly Asn Ser Leu Val Thr Leu Gly He Leu Ser Leu Tyr 
260 265 270 

Tyr Asp Val Arg Asn Phe Glu Ala Leu Tyr He Ser Gly Asp Asp Ser 
275 280 285 

Leu He Phe Ser Arg Ser Glu He Ser Asn Tyr Ala Asp Asp He Cys 
290 295 300 

Thr Asp Met Gly Phe Glu Thr Lys Phe Met Ser Pro Ser Val Pro Tyr 

310 315 320 

Phe Cys Ser Lys Phe Val Val Met Cys Gly His Lys Thr Phe Phe Val 
325 330 335 

Pro Asp Pro Tyr Lys Leu Phe Val Lys Leu Gly Ala Val Lys Glu Asp 
340 345 350 

Val Ser Met Asp Phe Leu Phe Glu Thr Phe Thr Ser Phe Lys Asp Leu 
355 360 365 

Thr Ser Asp Phe Asn Asp Glu Arg Leu He Gin Lys Leu Ala Glu Leu 
370 375 380 

Val Ala Leu Lys Tyr Glu Val Gin Thr Gly Asn Thr Thr Leu Ala Leu 
385 390 395 400 

Ser Val He His Cys Leu Arg Ser Asn Phe Leu Ser Phe Ser Lys Leu 
405 410 415 

Tyr Pro Arg Val Lys Gly Trp Gin Val Phe Tyr Thr Ser Val Lys Lys 
420 425 430 

Ala Leu Leu Lys Ser Gly Cys Ser Leu Phe Asp Ser Phe Met Thr Pro 
435 440 445 

Phe Gly Gin Ala Val Met Val Trp Asp Asp Glu 
450 455 

) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
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ATGAATCAGG TTTTGCAGTT TGAATGTTTG TTTCTGCTGA ATCTCGCGGT TTTTGCTGTG 6 0 

ACTTTCATTT TCATTCTTCT GGTCTTCCGC GTGATTAAGT CTTTTCGCCA GAAGGGTCAC 12 0 

GAAGCACCTG TTCCCGTTGT TCGTGGCGGG GGTTTTTCAA CCGTAGTGTA G 171 
(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Met Asn Gin Val Leu Gin Phe Glu Cys Leu Phe Leu Leu Asn Leu Ala 
1 5 10 15 

Val Phe Ala Val Thr Phe He Phe He Leu Leu Val Phe Arg Val He 
20 25 30 

Lys Ser Phe Arg Gin Lys Gly His Glu Ala Pro Val Pro Val Val Arg 
35 40 45 

Gly Gly Gly Phe Ser Thr Val Val 
50 55 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1800 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

ATGGTAGTTT TCGGTTTGGA CTTTGGCACC ACATTCTCTA CGGTGTGTGT GTACAAGGAT 6 0 

GGACGAGTTT TTTCATTCAA GCAGAATAAT TCGGCGTACA TCCCCACTTA CCTCTATCTC 120 

TTCTCCGATT CTAACCACAT GACTTTTGGT TACGAGGCCG AATCACTGAT GAGTAATCTG ISO 

AAAGTTAAAG GTTCGTTTTA TAGAGATTTA AAACGTTGGG TGGGTTGCGA TTCGAGTAAC 24 0 

CTCGACGCGT ACCTTGACCG TTTAAAACCT CATTACTCGG TCCGCTTGGT TAAGATCGGC 3 00 

TCTGGCTTGA ACGAAACTGT TTCAATTGGA AACTTCGGGG GCACTGTTAA GTCTGAGGCT 360 

CATCTGCCAG GGTTGATAGC TCTCTTTATT AAGGCTGTCA TTAGTTGCGC GGAGGGCGCG 42 0 
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TTTGCGTGCA CTTGCACCGG GGTTATTTGT TCAGTACCTG CCAATTATGA TAGCGTTCAA 4 80 

AGGAATTTCA CTGATCAGTG TGTTTCACTC AGCGGTTATC AGTGCGTATA TATGATCAAT 54 0 

GAACCTTCAG CGGCTGCGCT ATCTGCGTGT AATTCGATTG GAAAGAAGTC CGCAAATTTG 6 00 

GCTGTTTACG ATTTCGGTGG TGGGACCTTC GACGTGTCTA TCATTTCATA CCGCAACAAT 66 0 

ACTTTTGTTG TGCGAGCTTC TGGAGGCGAT CTAAATCTCG GTGGAAGGGA TGTTGATCGT 72 0 

GCGTTTCTCA CGCACCTCTT CTCTTTAACA TCGCTGGAAC CTGACCTCAC TTTGGATATC 78 0 

TCGAATCTGA AAGAATCTTT ATCAAAAACG GACGCAGAGA TAGTTTACAC TTTGAGAGGT 84 0 

GTCGATGGAA GAAAAGAAGA CGTTAGAGTA AACAAAAACA TTCTTACGTC GGTGATGCTC 900 

CCCTACGTGA ACAGAACGCT TAAGATATTA GAGTCAACCT TAAAATCGTA TGCTAAGAGT 96 0 

ATGAATGAGA GTGCGCGAGT TAAGTGCGAT TTAGTGCTGA TAGGAGGATC TTCATATCTT 102 0 

CCTGGCCTGG CAGACGTACT AACGAAGCAT CAGAGCGTTG ATCGTATCTT AAGAGTTTCG 108 0 

GATCCTCGGG CTGCCGTGGC CGTCGGTTGC GCATTATATT CTTCATGCCT CTCAGGATCT 114 0 

GGGGGGTTGC TACTGATCGA CTGTGCAGCT CACACTGTCG CTATAGCGGA CAGAAGTTGT 12 0 0 

CATCAAATCA TTTGCGCTCC AGCGGGGGCA CCGATCCCCT TTTCAGGAAG CATGCCTTTG 126 0 

TACTTAGCCA GGGTCAACAA GAACTCGCAG CGTGAAGTCG CCGTGTTTGA AGGGGAGTAC 132 0 

GTTAAGTGCC CTAAGAACAG AAAGATCTGT GGAGCAAATA TAAGATTTTT TGATATAGGA 13 8 0 

GTGACGGGTG ATTCGTACGC ACCCGTTACC TTCTATATGG ATTTCTCCAT TTCAAGCGTA 144 0 

GGAGCCGTTT CATTCGTGGT GAGAGGTCCT GAGGGTAAGC AAGTGTCACT CACTGGAACT 150 0 

CCAGCGTATA ACTTTTCGTC TGTGGCTCTC GGATCACGCA GTGTCCGAGA ATTGCATATT 156 0 

AGTTTAAATA ATTUU^GTTTT TCTCGGTTTG CTTCTACATA GAAAGGCGGA TCGACGAATA 162 0 

CTTTTCACTA AGGATGAAGC GATTCGATAC GCCGATTCAA TTGATATCGC GGATGTGCTA 16 8 0 

AAGGAATATA AAAGTTACGC GGCCAGTGCC TTACCACCAG ACGAGGATGT CGAATTACTC 174 0 

CTGGGAAAGT CTGTTCAAAA AGTTTTACGG GGAAGCAGAC TGGAAGAAAT ACCTCTCTAG 18 00 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 599 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



wo 98/53055 



- 109- 



PCT/US98/10313 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



Me 

1 



t val val Phe Gly Leu Asp Phe Gly Thr Thr Phe Ser Thr Val Cys 
5 10 15 . 



Val Tyr Lys Asp Gly Arg Val Phe Ser Phe Lys Gin Asn Asn Ser Ala 
20 25 30 

Tyr lie Pro Thr Tyr Leu Tyr Leu Phe Ser Asp Ser Asn His Met Thr 
35 40 45 

Phe Gly Tyr Glu Ala Glu Ser Leu Met Ser Asn Leu Lys Val Lys Gly 
50 55 60 



ser Phe Tyr Arg Asp Leu Lys Arg Trp Val Gly Cys Asp Ser Ser Asn 
65 



70 75 80 



Leu Asp Ala Tyr Leu Asp Arg Leu Lys Pro His Tyr Ser Val Arg Leu 
85 90 5^ 

Val Lys lie Gly Ser Gly Leu Asn Glu Thr Val Ser He Gly Asn Phe 
100 105 110 

Gly Gly Thr Val Lys Ser Glu Ala His Leu Pro Gly Leu He Ala Leu 
1X5 120 125 

Phe He Lys Ala Val He Ser Cys Ala Glu Gly Ala Phe Ala Cys Thr 
130 135 140 

CVS Thr Gly Val He Cys Ser Val Pro Ala Asn Tyr Asp Ser Val Gin 
145 150 155 160 

Arg Asn Phe Thr Asp Gin Cys Val Ser Leu Ser Gly Tyr Gin Cys Val 
165 1-70 175 

Tvr Met He Asn Glu Pro Ser Ala Ala Ala Leu Ser Ala Cys Asn Ser 
180 185 190 

He Gly Lys Lys Ser Ala Asn Leu Ala Val Tyr Asp Phe Gly Gly Gly 
195 200 205 

Thr Phe Asp Val Ser He He Ser Tyr Arg Asn Asn Thr Phe Val Val 
210 215 220 

Arg Ala Ser Gly Gly Asp Leu Asn Leu Gly Gly Arg Asp Val Asp Arg 



225 



230 235 240 



Ala Phe Leu Thr His Leu Phe Ser Leu Thr Ser Leu Glu Pro Asp Leu 
245 250 255 

Thr Leu Asp He Ser Asn Leu Lys Glu Ser Leu Ser Lys Thr Asp Ala 
260 265 270 

Glu He Val Tyr Thr Leu Arg Gly Val Asp Gly Arg Lys Glu Asp Val 
275 280 285 
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Arg Val Asn Lys Asn lie Leu Thr Ser Val Met Leu Pro Tyr Val Asn 
290 295 300 

Arg Thr Leu Lys He Leu Glu Ser Thr Leu Lys Ser Tyr Ala Lys Ser 
305 310 315 320 

Met Asn Glu Ser Ala Arg Val Lys Cys Asp Leu Val Leu He Gly Gly 
^25 330 335 

Ser Ser Tyr Leu Pro Gly Leu Ala Asp Val Leu Thr Lys His Gin Ser 

345 

Val Asp Arg He Leu Arg Val Ser Asp Pro Arg Ala Ala Val Ala Val 
355 360 365 

Gly Cys Ala Leu Tyr Ser Ser Cys Leu Ser Gly Ser Gly Gly Leu Leu 
370 375 380 

Leu He Asp Cys Ala Ala His Thr Val Ala He Ala Asp Arg Ser Cyc 
385 390 395 4OO 

His Gin He He Cys Ala Pro Ala Gly Ala Pro He Pro Phe Ser Gly 
405 410 415 

Ser Met Pro Leu Tyr Leu Ala Arg Val Asn Lys Asn Ser Gin Arg Glu 
420 425 430 

Val Ala Val Phe Glu Gly Glu Tyr Val Lys Cys Pro Lys Asn Arg Lys 
435 440 445 

He Cys Gly Ala Asn He Arg Phe Phe Asp He Gly Val Thr Gly Asp 
450 455 460 

Ser Tyr Ala Pro Val Thr Phe Tyr Met Asp Phe Ser He Ser Ser Val 
465 470 475 48O 

Gly Ala Val Ser Phe Val Val Arg Gly Pro Glu Gly Lys Gin Val Ser 
485 490 495 

Leu Thr Gly Thr Pro Ala Tyr Asn Phe Ser Ser Val Ala Leu Gly Ser 
500 505 510 

Arg Ser Val Arg Glu Leu His He Ser Leu Asn Asn Lys Val Phe Leu 
515 520 525 

Gly Leu Leu Leu His Arg Lys Ala Asp Arg Arg He Leu Phe Thr Lys 
530 535 540 

Asp Glu Ala He Arg Tyr Ala Asp Ser He Asp He Ala Asp Val Leu 
545 550 555 56O 

Lys Glu Tyr Lys Ser Tyr Ala Ala Ser Ala Leu Pro Pro Asp Glu Asp 
565 570 575 

Val Glu Leu Leu Leu Gly Lys Ser Val Gin Lys Val Leu Arg Gly Ser 
580 585 590 
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Arg Leu Glu Glu lie Pro Leu 
595 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1656 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

ATGTCGAATT ACTCCTGGGA AAGTCTGTTC AAAAAGTTTT ACGGGGAAGC AGACTGGAAG 6 0 

AAATACCTCT CTAGGAGCAT AGCAGCACAC TCAAGTGAAA TTAAAACTCT ACCAGACATT 12 0 

CGATTGTACG GCGGTAGGGT TGTAAAGAAG TCCGAATTCG AATCAGCACT TCCTAATTCT 180 

TTTGAACAGG AATTAGGACT GTTCATACTG AGCGAACGGG AAGTGGGATG GAGCAAATTA 24 0 

TGCGGAATAA CGGTGGAAGA AGCAGCATAC GATCTTACGA ATCCCAAGGC TTATAAATTC 3 00 

ACTGCCGAGA CATGTAGCCC GGATGTAAAA GGTGAAGGAC AAAAATACTC TATGGAAGAC 360 

GTGATGAATT TCATGCGTTT ATCAAATCTG GATGTTAACG ACAAGATGCT GACGGAACAG 420 

TGTTGGTCGC TGTCCAATTC ATGCGGTGAA TTGATCAACC CAGACGACAA AGGGCGATTC 4 80 

GTGGCTCTCA CCTTTAAGGA CAGAGACACA GCTGATGACA CGGGTGCCGC CAACGTGGAA 54 0 

TGTCGCGTGG GCGACTATCT AGTTTACGCT ATGTCCCTGT TTGAGCAGAG GACCCAAAAA 6 00 

TCGCAGTCTG GCAACATCTC TCTGTACGAA AAGTACTGTG AATACATCAG GACCTACTTA 66 0 

GGGAGTACAG ACCTGTTCTT CACAGCGCCG GACAGGATTC CGTTACTTAC GGGCATCCTA 72 0 

TACGATTTTT GTAAGGAATA CAACGTTTTC TACTCGTCAT ATAAGAGAAA CGTCGATAAT 780 

TTCAGATTCT TCTTGGCGAA TTATATGCCT TTGATATCTG ACGTCTTTGT CTTCCAGTGG 84 0 

GTAAAACCCG CGCCGGATGT TCGGCTGCTT TTTGAGTTAA GTGCAGCGGA ACTAACGCTG 900 

GAGGTTCCCA CACTGAGTTT GATAGATTCT CAAGTTGTGG TAGGTCATAT CTTAAGATAC 96 0 

GTAGAATCCT ACACATCAGA TCCAGCCATC GACGCGTTAG AAGACAAACT GGAAGCGATA 102 0 

CTGAAAAGTA GCAATCCCCG TCTATCGACA GCGCAACTAT GGGTTGGTTT CTTTTGTTAC 1080 

TATGGTGAGT TTCGTACGGC TCAAAGTAGA GTAGTGCAAA GACCAGGCGT ATACAAAACA 114 0 

CCTGACTCAG TGGGTGGATT TGAAATAAAC ATGAAAGATG TTGAGAAATT CTTCGATAAA 1200 



1656 
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CTTCAGAGAG AATTGCCTAA TGTATCTTTG CGGCGTCAGT TTAACGGAGC TAGAGCGCAT 126 0 

GAGGCTTTCA AAATATTTAA AAACGGAAAT ATT^AGTTTCA GACCTATATC GCGTTTTUU^C 132 0 

GTGCCTAGAG AGTTCTGGTA TCTGAACATA GACTACTTCA GGCACGCGAA TAGGTCCGGG 13 80 

TTAACCGAAG AAGAAATACT CATCCTAAAC AACATAAGCG TTGATGTTAG GAAGTTATGC 144 0 

GCTGAGAGAG CGTGCAATAC CCTACCTAGC GCGAAGCGCT TTAGTAAAAA TCATAAGAGT 1500 

AATATACAAT CATCACGCCA AGAGCGGAGG ATTAAAGACC CATTGGTAGT CCTGAAAGAC 156 0 

ACTTTATATG AGTTCCAACA CAAGCGTGCC GGTTGGGGGT CTCGAAGCAC TCGAGACCTC 162 0 
GGGAGTCGTG CTGACCACGC GAAAGGAAGC GGTTGA 
(2) INFORMATION FOR SEQ ID NO : 11 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 551 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Ser Asn Tyr Ser Trp Glu Ser Leu Phe Lys Lys Phe Tyr Gly Glu 
15 10 15 

Ala Asp Trp Lys Lys Tyr Leu Ser Arg Ser He Ala Ala His Ser Ser 
20 25 30 

Glu He Lys Thr Leu Pro Asp He Arg Leu Tyr Gly Gly Arg Val Val 
35 40 45 

Lys Lys Ser Glu Phe Glu Ser Ala Leu Pro Asn Ser Phe Glu Gin Glu 
50 55 60 

Leu Gly Leu Phe He Leu Ser Glu Arg Glu Val Gly Trp Ser Lys Leu 
^5 70 75 80 

Cys Gly He Thr Val Glu Glu Ala Ala Tyr Asp Leu Thr Asn Pro Lys 
85 90 95 

Ala Tyr Lys Phe Thr Ala Glu Thr Cys Ser Pro Asp Val Lys Gly Glu 
100 105 110 

Gly Gin Lys Tyr Ser Met Glu Asp Val Met Asn Phe Met Arg Leu Ser 

120 125 ■ 

Asn Leu Asp Val Asn Asp Lys Met Leu Thr Glu Gin Cys Trp Ser Leu 
130 135 140 

Ser Asn Ser Cys Gly Glu Leu He Asn Pro Asp Asp Lys Gly Arg Phe 
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145 150 



155 160 



Val Ala Leu Thr Phe Lys Asp Arg Asp Thr Ala Asp Asp Thr Gly Ala 
165 170 175 

Ala Asn Val Glu Cys Arg Val Gly Asp Tyr Leu Val Tyr Ala Met Ser 
180 185 190 

Leu Phe Glu Gin Arg Thr Gin Lys Ser Gin Ser Gly Asn lie Ser Leu 
195 200 205 

Tyr Glu Lys Tyr Cys Glu Tyr He Arg Thr Tyr Leu Gly Ser Thr Asp 
210 215 220 

Leu Phe Phe Thr Ala Pro Asp Arg He Pro Leu Leu Thr Gly He Leu 
225 230 235 240 

Tyr Asp Phe Cys Lys Glu Tyr Asn Val Phe Tyr Ser Ser Tyr Lys Arg 
245 250 255 

Asn val Asp Asn Phe Arg Phe Phe Leu Ala Asn Tyr Met Pro Leu He 
260 265 270 

Ser Asp Val Phe Val Phe Gin Trp Val Lys Pro Ala Pro Asp Val Arg 
275 280 285 

Leu Leu Phe Glu Leu Ser Ala Ala Glu Leu Thr Leu Glu Val Pro Thr 
290 295 300 

Leu ser Leu He Asp Ser Gin Val Val Val Gly His He Leu Arg Tyr 
305 310 ' 315 320 

Val Glu Ser Tyr Thr Ser Asp Pro Ala He Asp Ala Leu Glu Asp Lys 
325 330 335 

Leu Glu Ala He Leu Lys Ser Ser Asn Pro Arg Leu Ser Thr Ala Gin 
340 345 350 

Leu Trp Val Gly Phe Phe Cys Tyr Tyr Gly Glu Phe Arg Thr Ala Gin 
355 360 365 

Ser Arg Val Val Gin Arg Pro Gly Val Tyr Lys Thr Pro Asp Ser Val 
370 375 380 

Gly Gly Phe Glu He Asn Met Lys Asp Val Glu Lys Phe Phe Asp Lys 
385 390 395 400 

Leu Gin Arg Glu Leu Pro Asn Val Ser Leu Arg Arg Gin Phe Asn Gly 
405 410 415 

Ala Arg Ala His Glu Ala Phe Lys He Phe Lys Asn Gly Asn He Ser 
420 425 430 

Phe Arg Pro He Ser Arg Leu Asn Val Pro Arg Glu Phe Trp Tyr Leu 
435 440 445 

Asn He Asp Tyr Phe Arg His Ala Asn Arg Ser Gly Leu Thr Glu Glu 
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450 455 460 

Glu He Leu He Leu Asn Asn He Ser Val Asp Val Arg Lys Leu Cys 
465 470 475 480 

Ala Glu Arg Ala Cys Asn Thr Leu Pro Ser Ala Lys Arg Phe Ser Lys 
485 490 495 

Asn His Lys Ser Asn He Gin Ser Ser Arg Gin Glu Arg Arg He Lys 
500 505 510 

Asp Pro Leu Val Val Leu Lys Asp Thr Leu Tyr Glu Phe Gin His Lys 
515 520 525 

Arg Ala Gly Trp Gly Ser Arg Ser Thr Arg Asp Leu Gly Ser Arg Ala 
530 535 540 

Asp His Ala Lys Gly Ser Gly 
545 550 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 672 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

ATGAGTTCCA ACACAAGCGT GCCGGTTGGG GGTCTCGAAG CACTCGAGAC CTCGGGAGTC 6 0 

GTGCTGACCA CGCGAAAGGA AGCGGTTGAT AAGTTTTTTA ATGAACTAAA AAACGAAAAT 12 0 

TACTCATCAG TTGACAGCAG CCGATTAAGC GATTCGGAAG TAAAAGAAGT GTTAGAGAAA 180 

AGTAAAGAAA GTTTCAAAAG CGAACTGGCC TCCACTGACG AGCACTTCGT CTACCACATT 24 0 

ATATTTTTCT TAATCCGATG TGCTAAGATA TCGACAAGTG AAAAGGTGAA GTACGTTGGT 3 00 

AGTCATACGT ACGTGGTCGA CGGAAAAACG TACACCGTTC TTGACGCTTG GGTATTCAAC 36 0 

ATGATGAAAA GTCTCACGAA GAAGTACAAA CGAGTGAATG GTCTGCGTGC GTTCTGTTGC 42 0 

GCGTGCGAAG ATCTATATCT AACCGTCGCA CCAATAATGT CAGAACGCTT TAAGACTAAA 480 

GCCGTAGGGA TGAAAGGTTT GCCTGTTGGA AAGGAATACT TAGGCGCCGA CTTTCTTTCG 54 0 

GGAACTAGCA AACTGATGAG CGATCACGAC AGGGCGGTCT CCATCGTTGC AGCGAAAAAC 6 00 

GCTGTCGATC GTAGCGCTTT CACGGGTGGG GAGAGAAAGA TAGTTAGTTT GTATGATCTA 66 0 
GGGAGGTACT AA 

672 
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(2) INFORMATION FOR SEQ ID N0:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Ser Ser Asn Thr Ser Val Pro Val Gly Gly Leu Glu Ala Leu Glu 
1 S 10 15 

Thr Ser Gly Val Val Leu Thr Thr Arg Lys Glu Ala Val Asp Lys Phe 
20 25 30 

Phe Asn Glu Leu Lys Asn Glu Asn Tyr Ser Ser Val Asp Ser Ser Arg 
35 40 45 

Leu Ser Asp Ser Glu Val Lys Glu Val Leu Glu Lys Ser Lys Glu Ser 
50 55 60 

Phe Lys Ser Glu Leu Ala Ser Thr Asp Glu His Phe Val Tyr His He 
65 70 75 80 

He Phe Phe Leu He Arg Cys Ala Lys He Ser Thr Ser Glu Lys Val 
85 90 95 

Lys Tyr Val Gly Ser His Thr Tyr Val Val Asp Gly Lys Thr Tyr Thr 
100 105 110 

Val Leu Asp Ala Trp Val Phe Asn Met Met Lys Ser Leu Thr Lys Lys 
115 120 125 

Tyr Lys Arg Val Asn Gly Leu Arg Ala Phe Cys Cys Ala Cys Glu Asp 
130 135 140 

Leu Tyr Leu Thr Val Ala Pro He Met Ser Glu Arg Phe Lys Thr Lys 
145 150 155 160 

Ala Val Gly Met Lys Gly Leu Pro Val Gly Lys Glu Tyr Leu Gly Ala 
165 170 175 

Asp Phe Leu Ser Gly Thr Ser Lys Leu Met Ser Asp His Asp Arg Ala 
180 185 190 

Val Ser He Val Ala Ala Lys Asn Ala Val Asp Arg Ser Ala Phe Thr 
195 200 205 

Gly Gly Glu Arg Lys He Val Ser Leu Tyr Asp Leu Gly Arg Tyr 
210 215 220 

(2) INFORMATION FOR SEQ ID NO:14: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 597 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
ATGGAGTTGA TGTCCGACAG 
AATGGTGTCG ACAAGAAGCT 
GCTCCTAACG AGGGTATAGA 
ACCACGTCTC CTAAGGTTCA 
GAGAGGAATG TGGTAGTAAC 
CTCACTAGGT TCACTAATAA 
GACTTTTGTA TCGCGTATAA 
ATTCCAGCTG AAGATTCGTA 
GAATTACAGC AAAGTAGGAA 
GTGGTAAATA CACCAGTGAG 
(2) INFORMATION FOR SEQ ID NO: 15: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 198 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Glu Leu Met Ser Asp Ser Asn Leu Ser Asn Leu Val He Thr Asp 
15 10 15 

Ala Ser Ser Leu Asn Gly Val Asp Lys Lys Leu Leu Ser Ala Glu Val 
20 25 30 

Glu Lys Met Leu Val Gin Lys Gly Ala Pro Asn Glu Gly He Glu Val 
35 40 45 

Val Phe Gly Leu Leu Leu Tyr Ala Leu Ala Ala Arg Thr Thr Ser Pro 
50 55 60 

Lys Val Gin Arg Ala Asp Ser Asp Val He Phe Ser Asn Ser Phe Gly 



CAACCTTAGC AACCTGGTGA TAACCGACGC CTCTAGTCTA 6 0 

TTTATCTGCT GAAGTTGAAA AAATGTTGGT GCAGAAAGGG 12 0 

AGTGGTGTTC GGTCTACTCC TTTACGCACT CGCGGCAAGA 18 0 

GCGCGCAGAT TCAGACGTTA TATTTTCAAA TAGTTTCGGA 24 0 

AGAGGGTGAC CTTAAGAAGG TACTCGACGG GTGTGCGCCT 3 00 

ACTTAGAACG TTCGGTCGTA CTTTCACTGA GGCTTACGTT 36 0 

GCACAAATTA CCCCAACTCA ACGCCGCGGC GGAATTGGGG 42 0 

CTTAGCTGCA GATTTTCTGG GTACTTGCCC GAAGCTCTCT 48 0 

GATGTTCGCG AGTATGTACG CTCTAAAAAC TGAAGGTGGA 54 0 

CAATCTGCGT CAGCTAGGTA GAAGGGAAGT TATGTAA 597 
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Glu Arg Asn Val Val Val Thr Glu Gly Asp Leu Lys Lys Val Leu Asp 
85 90 95 

Gly Cys Ala Pro Leu Thr Arg Phe Thr Asn Lys Leu Arg Thr Phe Gly 
100 105 110 

Arg Thr Phe Thr Glu Ala Tyr Val Asp Phe Cys He Ala Tyr Lys His 
115 120 125 

Lys Leu Pro Gin Leu Asn Ala Ala Ala Glu Leu Gly He Pro Ala Glu 
130 135 140 

Asp Ser Tyr Leu Ala Ala Asp Phe Leu Gly Thr Cys Pro Lys Leu Ser 



145 



150 155 160 



Glu Leu Gin Gin Ser Arg Lys Met Phe Ala Ser Met Tyr Ala Leu Lys 
165. l^O 175 

Thr Glu Gly Gly Val Val Asn Thr Pro Val Ser Asn Leu Arg Gin Leu 
180 185 190 

Gly Arg Arg Glu Val Met 
195 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 486 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
ATGGAAGATT ACGAAGAAAA ATCCGAATCG CTCATACTGC TACGCACGAA TCTGAACACT 

ATGCTTTTAG TGGTCAAGTC CGATGCTAGT GTAGAGCTGC CTAAACTACT AATTTGCGGT 12 0 

TACTTACGAG TGTCAGGACG TGGGGAGGTG ACGTGTTGCA ACCGTGAGGA ATTAACAAGA 180 

GATTTTGAGG GCAATCATCA TACGGTGATC CGTTCTAGAA TCATACAATA TGACAGCGAG 24 0 

TCTGCTTTTG AGGAATTCAA CAACTCTGAT TGCGTAGTGA AGTTTTTCCT AGAGACTGGT 3 00 

AGTGTCTTTT GGTTTTTCCT TCGAAGTGAA ACCAAAGGTA GAGCGGTGCG ACATTTGCGC 3 60 

ACCTTCTTCG AAGCTAACAA TTTCTTCTTT GGATCGCATT GCGGTACCAT GGAGTATTGT 420 

TTGAAGCAGG TACTAACTGA AACTGAATCT ATAATCGATT CTTTTTGCGA AGAAAGAAAT 4 80 

CGTTAA 



60 



486 



wo 98/53055 



-118- 



PCTAJS98/10313 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Glu Asp Tyr Glu Glu Lys Ser Glu Ser Leu lie Leu Leu Arg Thr 
IS 10 15 

Asn Leu Asn Thr Met Leu Leu Val Val Lys Ser Asp Ala Ser Val Glu 

20 25 30 

Leu Pro Lys Leu Leu lie Cys Gly Tyr Leu Arg Val Ser Gly Arg Gly 



Glu Val Thr Cys Cys Asn Arg Glu Glu Leu Thr Arg Asp Phe Glu Gly 
50 55 60 

Asn His His Thr Val He Arg Ser Arg He He Gin Tyr Asp Ser Glu 
65 70 75 80 

Ser Ala Phe Glu Glu Phe Asn Asn Ser Asp Cys Val Val Lys Phe Phe 
85 90 95 

Leu Glu Thr Gly Ser Val Phe Trp Phe Phe Leu Arg Ser Glu Thr Lys 
100 105 110 

Gly Arg Ala Val Arg His Leu Arg Thr Phe Phe Glu Ala Asn Asn Phe 
115 120 125 

Phe Phe Gly Ser His Cys Gly Thr Met Glu Tyr Cys Leu Lys Gin Val 
130 135 140 

Leu Thr Glu Thr Glu Ser He He Asp Ser Phe Cys Glu Glu Arg Asn 
145 150 155 160 

Arg 



2) INFORMATION FOR SEQ ID NO: 18: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 618 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



618 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

ATGAGGGTTA TAGTGTCTCC TTATGAAGCT GAAGACATTC TGAAAAGATC GACTGACATG 6 0 

TTACGAAACA TAGACAGTGG GGTCTTGAGC ACTAAAGAAT GTATCAAGGC ATTCTCGACG 120 

ATAACGCGAG ACCTACATTG TGCGAAGGCT TCCTACCAGT GGGGTGTTGA CACTGGGTTA 180 

TATCAGCGTA ATTGCGCTGA AAAACGTTTA ATTGACACGG TGGAGTCAAA CATACGGTTG 24 0 

GCTCAACCTC TCGTGCGTGA AAAAGTGGCG GTTCATTTTT GTAAGGATGA ACCAAAAGAG 3 00 

CTAGTAGCAT TCATCACGCG AAAGTACGTG GAACTCACGG GCGTGGGAGT GAGAGAAGCG 360 

GTGAAGAGGG AAATGCGCTC TCTTACCAAA ACAGTTTTAA ATAAAATGTC TTTGGAAATG 42 0 

GCGTTTTACA TGTCACCACG AGCGTGGAAA AACGCTGAAT GGTTAGAACT AAAATTTTCA 48 0 

CCTGTGAAAA TCTTTAGAGA TCTGCTATTA GACGTGGAAA CGCTCAACGA ATTGTGCGCC 54 0 

GAAGATGATG TTCACGTCGA CAAAGTAAAT GAGAATGGGG ACGAAAATCA CGACCTCGAA 6 00 
CTCCAAGACG AATGTTAA 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 205 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Arg Val He Val Ser Pro Tyr Glu Ala Glu Asp He Leu Lys Arg 
1 5 10 15 

Ser Thr Asp Met Leu Arg Asn He Asp Ser Gly Val Leu Ser Thr Lys 
20 25 30 

Glu Cys He Lys Ala Phe Ser Thr He Thr Arg Asp Leu His Cys Ala 
35 40 45 

Lvs Ala Ser Tyr Gin Trp Gly Val Asp Thr Gly Leu Tyr Gin Arg Asn 
50 55 60 

CVS Ala Glu Lys Arg Leu He Asp Thr Val Glu Ser Asn He Arg Leu 
65 70 75 80 

Ala Gin Pro Leu Val Arg Glu Lys Val Ala Val His Phe Cys Lys Asp 
85 90 95 

Glu pro Lys Glu Leu Val Ala Phe He Thr Arg Lys Tyr Val Glu Leu 
100 105 110 
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Thr Gly Val Gly 
115 

Thr Lys Thr Val 
130 

Ser Pro Arg Ala 
145 

Pro Val Lys lie 



Glu Leu Cys Ala 
180 

Gly Asp Glu Asn 
195 



Val Arg Glu Ala 
120 

Leu Asn Lys Met 
135 

Trp Lys Asn Ala 
150 

Phe Arg Asp Leu 
165 

Glu Asp Asp Val 



His Asp Leu Glu 
200 



Val Lys Arg Glu 



Ser Leu Glu Met 
14 0 

Glu Trp Leu Glu 
155 

Leu Leu Asp Val 
170 

His Val Asp Lys 
185 

Leu Gin Asp Glu 



Met Arg Ser Leu 
125 

Ala Phe Tyr Met 

Leu Lys Phe Ser 
160 

Glu Thr Leu Asn 
175 

Val Asn Glu Asn 
190 

Cys 
205 



(2) INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TGCTGGAGCT TGAGGTTCTG C 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHJUIACTERISTICS : 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CGGAATTCAC CATGGAGTTG ATGTCCGACA G 
(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
AGCGGATCCA TGGCAGATTC GTGCGTAGCA GTA 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 216 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

ACATTGGTTA AGTTTAACGA AAATGATTAG TAAATAATAA ATCGAACGTG GGTGTATCTA 6 0 

CCTGACGTAT CAACTTAAGC TGTTACTGAG TAATTAAACC AACAAGTGTT GGTGTAATGT 12 0 

GTATGTTGAT GTAGAGAAAA ATCCGTTTGT AGAACGGTGT TTTTCTCTTC TTTATTTTTA 180 

AAAAAAAAAT AAT^A/VAAAAA AAAAAAAAGC GGCCGC 216 
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WHAT IS CLAIMED: 

L An isolated protein or polypeptide corresponding to a protein or 
polypeptide of a grapevine leafroll virus (type 2). 

2. An isolated protein or polypeptide according to claim 1 , wherein the 
protein or polypeptide is selected from a group consisting of a polyprotein, an RNA- 
dependent RNA polymerase, a heat shock 70 protein, a heat shock 90 protein, a 
diverged coat protein, and a coat protein. 

3. An isolated protein or polypeptide according to claim 2, wherein the 
protein or polypeptide is a polyprotein. 

4. An isolated protein or polypeptide according to claim 3, wherein the 
protein or polypeptide comprises an amino acid sequence corresponding to SEQ. ID. 
No, 3. 

5. An isolated protein or polypeptide according to claim 2, wherein the 
protein or polypeptide is an RNA-dependent RNA polymerase having a molecular 
weight of from about 50 to about 54 kDa. 

6. An isolated protein or polypeptide according to claim 5, wherein the 
protein or polypeptide comprises an amino acid sequence corresponding to SEQ. ID. 
No. 5. 

7. An isolated protein or polypeptide according to claim 2, wherein the 
protein or polypeptide is a heat shock 70 protein having a molecular weight of from 
about 63 to about 67 kDa. 

8. An isolated protein or polypeptide according to claim 7, wherein the 
protein or polypeptide comprises an amino acid sequence corresponding to SEQ. ID. 
No. 9. 

9. An isolated protein or polypeptide according to claim 2, wherein the 
protein or polypeptide is a heat shock 90 protein having a molecular weight of from 
about 61 to about 65 kDa. 
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10. An isolated protein or polypeptide according to claim 9, wherein the 
protein or polypeptide comprises an amino acid sequence corresponding to SEQ. ID. 
No. 11. 

11. An isolated protein or polypeptide according to claim 2, wherein the 
protein or polypeptide is a coat protein having a molecular weight of from about 20 to 
about 24 kDa. 

12. An isolated protein or polypeptide according to claim 1 1, wherein the 
protein or polypeptide comprises an amino acid sequence corresponding to SEQ. ID. 
No. 15. 

13. An isolated protein or polypeptide according to claim 2, wherein the 
protein or polypeptide is a diverged coat protein having a molecular weight of from 
about 23 to about 27 kDa. 

14. An isolated protein or polypeptide according to claim 1 3, wherein the 
protein or polypeptide comprises an amino acid sequence corresponding to SEQ. ID. 
No. 13. 

15. An isolated protein or polypeptide according to claim 1 , wherein the 
protein or polypeptide comprises an amino acid sequence corresponding to SEQ. ID. 
No. 7. 

1 6. An isolated protein or polypeptide according to claim 1 , wherein the 
protein or polypeptide comprises an amino acid sequence corresponding to SEQ. ID. 
No. 17. 

1 7. An isolated protein or polypeptide according to claim 1 , wherein the 
protein or polypeptide comprises an amino acid sequence corresponding to SEQ. ID. 
No. 19. 



1 8. An isolated protein or polypeptide according to claim 1, wherein the 
protein or polypeptide is purified. 
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1 9. An isolated protein or polypeptide according to claim 1 , wherein the 
protein or polypeptide is recombinant. 

20. An isolated RNA molecule encoding a protein or polypeptide according to 
claim 1. 

21 . An isolated RNA molecule according to claim 20, wherein the protein or 
polypeptide is selected from a group consisting of a polyprotein, an RNA-dependent 
RNA polymerase, a heat shock 70 protein, a heat shock 90 protein, a diverged coat 
protein, and a coat protein. 

22. An isolated DNA molecule encoding a protein or polypeptide according to 
claim 1. 

23. An isolated DNA molecule according to claim 22, wherein the protein or 
polypeptide is selected from a group consisting of a polyprotein, an RNA-dependent 
RNA polymerase, a heat shock 70 protein, a heat shock 90 protein, a diverged coat 
protein, and a coat protein. 

24. An isolated DNA molecule according to claim 23, wherein the protein or 
polypeptide is a polyprotein comprising conserved regions of a helicase, a papain-like 
protease, and a methyltransferase. 

25. An isolated DNA molecule according to claim 24, wherein the protein or 
polypeptide comprises an amino acid sequence corresponding to SEQ. ID. No. 3. 

26. An isolated DNA molecule according to claim 25, wherein the DNA 
molecule has a nucleotide sequence corresponding to SEQ. ID. No. 2. 

27. An isolated DNA molecule according to claim 23, wherein the protein or 
polypeptide is an RNA-dependent RNA polymerase having a molecular weight of 
from about 50 to about 54 kDa. 

28. An isolated DNA molecule according to claim 27, wherein the protein or 
polypeptide comprises an amino acid sequence corresponding to SEQ. ID. No. 5. 
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29. An isolated DNA molecule according to claim 28, wherein the DNA 
molecule has a nucleotide sequence corresponding to SEQ. ID. No. 4. 

30. An isolated DNA molecule according to claim 23, wherein the protein or 
polypeptide is a heat shock 70 protein having a molecular weight of from about 63 to 
about 67 kDa. 

31 . An isolated DNA molecule according to claim 30, wherein the protein or 
polypeptide comprises an amino acid sequence corresponding to SEQ. ID. No. 9. 

32. An isolated DNA molecule according to claim 31 , wherein the DNA 
molecule has a nucleotide sequence corresponding to SEQ. ID. No. 8. 

33. An isolated DNA molecule according to claim 23, wherein the protein or 
polypeptide is a heat shock 90 protein having a molecular weight of from about 61 to 
about 65 kDa. 

34. An isolated DNA molecule according to claim 33, wherein the protein or 
polypeptide comprises an amino acid sequence corresponding to SEQ. ID. No. 1 1 . 

35. An isolated DNA molecule according to claim 34, wherein the DNA 
molecule has a nucleotide sequence corresponding to SEQ. ID. No. 10. 

36. An isolated DNA molecule according to claim 23, wherein the protein or 
polypeptide is a coat protein having a molecular weight of from about 20 to about 24 
kDa. 

37. An isolated DNA molecule according to claim 36, wherein the protein or 
polypeptide comprises an amino acid sequence corresponding to SEQ. ID. No. 15. 

38. An isolated DNA molecule according to claim 37, wherein the DNA 
molecule comprises a nucleotide sequence corresponding to SEQ. ID. No. 14. 
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39. An isolated DNA molecule according to claim 23, wherein the protein or 
polypeptide is a diverged coat protein having a molecular weight of from about 23 to 
about 27 kDa. 

40. An isolated DNA molecule according to claim 39, wherein the protein or 
polypeptide comprises an amino acid sequence corresponding to SEQ. ID. No. 13. 

41. An isolated DNA molecule according to claim 40, wherein the DNA 
molecule comprises a nucleotide sequence corresponding to SEQ. ID. No. 12. 

42. An isolated DNA molecule according to claim 22, wherein the protein or 
polypeptide comprises an amino acid sequence corresponding to SEQ. ID, No. 7. 

43. An isolated DNA molecule according to claim 42, wherein the DNA 
molecule comprises a nucleotide sequence corresponding to SEQ. ID. No. 6. 

44. An isolated DNA molecule according to claim 22, wherein the protein or 
polypeptide comprises an amino acid sequence coiTesponding to SEQ. ID. No. 17. 

45. An isolated DNA molecule according to claim 44, wherein the DNA 
molecule comprises a nucleotide sequence corresponding to SEQ. ID. No. 16. 

46. An isolated DNA molecule according to claim 22, wherein the protein or 
polypeptide comprises an amino acid sequence corresponding to SEQ. ID. No. 19. 

47. An isolated DNA molecule according to claim 46, wherein the DNA 
molecule comprises a nucleotide sequence corresponding to SEQ. ID. No. 18. 

48. An isolated DNA molecule according to claim 22, wherein the DNA 
molecule comprises a nucleotide sequence corresponding to SEQ. ID. No. 23. 

49. An expression system comprising an DNA molecule according to claim 22 
in a vector heterologous to the DNA molecule. 

50. An expression system according to claim 49, wherein the protein or 
polypeptide is selected from a group consisting of a polyprotein, an RNA-dependent 
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RNA polymerase, a heat shock 70 protein, a heat shock 90 protein, a diverged coat 
protein, and a coat protein. 

51. A host cell transformed with a heterologous DNA molecule according to 
claim 22. 

52. A host cell according to claim 51 , wherein the host cell is selected from 
the group consisting ofAgrobacterium vitis and Agrobacterium tumefaciem. 

53. A host cell according to claim 51 , wherein the host cell is selected from a 
group consisting of a grape cell, a citrus cell, a beet cell, and a tobacco cell. 

54. A host cell according to claim 51 , wherein the protein or polypeptide is 
selected from a group consisting of a polyprotein, an RNA-dependent RNA 
polymerase, a heat shock 70 protein, a heat shock 90 protein, a diverged coat protein, 
and a coat protein. 

55. A transgenic plant cuhivar comprising the DNA molecule according to 
claim 22. 

56. A transgenic plant cultivar according to claim 55, wherein the plant 
cuhivar is selected from a group consisting of a grape plant cultivar, a citrus plant 
cultivar, a beet plant cultivar, and a tobacco plant cultivar. 

57. A transgenic plant cultivar according to claim 55, wherein the protein or 
polypeptide is selected from a group consisting of a polyprotein, an RNA-dependent 
RNA polymerase, a heat shock 70 protein, a heat shock 90 protein, a diverged coat 
protein, and a coat protein. 

58. A method of imparting grapevine leafroU virus resistance to a Vitis scion 
or rootstock cultivar or a Nicotiana cultivar comprising: 

transforming a Vitis scion or rootstock cultivar or a Nicotiana cultivar with a 
DNA molecule according to claim 22. 

59. A method according to claim 58, wherein the protein or polypeptide is 
selected from a group consisting of a polyprotein, an RNA-dependent RNA 
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polymerase, a heat shock 70 protein, a heat shock 90 protein, a diverged coat protein, 
and a coat protein. 

60. A method according to claim 58, wherein the grapevine leafroll virus is 
GLRaV-2. 

61. A method according to claim 58, wherein said transforming is 
Agrobacteriwn mediated. 

62. A method according to claim 58, wherein said transforming comprises: 
propelling particles at grape or tobacco plant cells under conditions effective 

for the particles to penetrate into the cell interior and 

introducmg an expression vector comprising the DNA molecule into the cell 
interior. 

63. A method of imparting beet yellows virus resistance to a beet cultivar 
comprising: 

transforming a beet cultivar with a DNA molecule according to claim 22. 

64. A method according to claim 63, wherein the protein or polypeptide is 
selected from a group consisting of a polyprotein, an RNA-dependent RNA 
polymerase, a heat shock 70 protein, a heat shock 90 protein, a diverged coat protein, 
and a coat protein. 

65. A method according to claim 63, wherein said transforming is 
Agrobacterium mediated. 

66. A method according to claim 63, wherein said transforming comprises: 
propelling particles at beet plant cells under conditions effective for the 

particles to penetrate into the cell interior and 

introducing an expression vector comprising the DNA molecule into the cell 
interior, 

67. A method of imparting tristeza virus resistance to a citrus scion cultivar or 
rootstock cultivar comprising: 
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transforming a citrus scion cultivar or rootstock cultivar with a DNA molecule 
according to claim 22. 

68. A method according to claim 67, wherein the protein or polypeptide is 
selected from a group consisting of a polyprotein, an RNA-dependent RNA 
polymerase, a heat shock 70 protein, a heat shock 90 protein, a diverged coat protein, 
and a coat protein. 

69. A method according to claim 67, wherein said transforming is 
Agrobacterium mediated. 

70. A method according to claim 67, wherein said transforming comprises: 
propelling particles at citrus plant cells under conditions effective for the 

particles to penetrate into the cell interior and 

introducing an expression vector comprising the DNA molecule into the cell 
interior. 

71. An antibody or binding portion thereof or probe recognizing the protein or 
polypeptide according to claim 1 . 

72. An antibody or binding portion thereof or probe according to claim 71 , 
wherein the protein or polypeptide is selected from a group consisting of a polyprotein, 
an KNA-dependent RNA polymerase, a heat shock 70 protein, a heat shock 90 protein, 
a diverged coat protein, and a coat protein. 

73. A method for detection of grapevine leafroll virus in a sample, said 
method comprising: 

providing an antibody or binding portion thereof recognizing the protein or 

polypeptide according to claim 1; 

contacting the sample with the antibody or binding portion thereof; and 
detecting any reaction which indicates that grapevine leafroll virus is present in 

the sample using an assay system. 

74. A method according to claim 73, wherein the assay system is selected 
from a group consisting of enzyme-lirLked immunoabsorbent assay. 
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radioimmunoassay, gel diffusion precipitin reaction assay, immunodiffusion assay, 
agglutination assay, fluorescent immunoassay, protein A immunoassay, and 
immunoelectrophoresis assay. 

75. A method for detection of grapevine leafroll virus in a sample, said 
method comprising: 

providing a nucleotide sequence of the DNA molecule according to claim 22 as 
a probe in a nucleic acid hybridization assay; 

contacting the sample with the probe; and 

detecting any reaction which indicates that grapevine leafroll virus is present in 
the sample. 

76, A method according to claim 75, wherein the nucleic acid hybridization 
assay is selected from a group consisting of dot blot hybridization, tissue printing, 
southern hybridization, and northern hybridization. 

0 77. A method for detection of grapevine leafroll virus in a sample: 

providing a nucleotide sequence of the DNA molecule according to claim 22 as 
a probe in a gene amplification detection procedure; 
contacting the sample with the probe; and 

detecting any reaction which indicates that grapevine leafroll virus is present in 
the sample. 

78. A method according to claim 77, wherein the gene amplification detection 
procedure is selected from a group consisting of polymerase chain reaction and 
immunocapture polymerase chain reaction. 
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